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POLYPEPTIDES OF THE "BASIC HELIX-LOOP-HELIX" BHLH 
FAMILY, CORRESPONDING NUCLEIC ACID SEQUENCES 

RELATED APPLICATIONS 

5 This application is a Continuation-in-Part of application 09/331,356 filed 

July 12, 1999, which is a 35 USC §371 application of PCT/FR97/02368 filed 
December 19, 1997. This application claims the benefit of foreign priority under 
35 U.S.C. § 1 19(a) to application FR96/15651 filed in France on December 19, 
1996. 
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FIELD OF THE INVENTION 

The present invention relates to new bHLH-type polypeptides and to their 
corresponding coding nucleic acids. It also relates to expression vectors integrating 
said nucleic acids and to the use of these nucleic acids, vectors, or polypeptides for 
1 5 therapeutic purposes. 

BACKGROUND OF THE INVENTION 

The embryonic neuronal tube is at the outset composed of an 
undifferentiated neuroepithehum over its entire length. The cells then differentiate, 

2 0 depending on their positions on the anterior/posterior, dorsal/ventral and 

medium/lateral axes thereby generating various neuronal structures. This 
differentiation is in part controlled by genetic factors that subdivide the neuronal 
tube into several histogenic regions. It is now known that numerous transcriptional 
factors and, more particularly, products encoded by the so-called bHLH "Basic 
25 Helix Loop Helix" genes (Gaudy et aL, 1988, Cell, 55, 1061-67) are involved in 
this differentiation phenomenon. The proteins of this family have highly conserved 
amino acid units and more particularly a basic domain followed by Helix 1 and 
Helix 2. The latter two units are separated by a loop of variable size. In vertebrates, 
certain products of these bHLH genes participate in the various stages of 

3 0 neurogenesis during neuronal determination and differentiation. By way of 
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representatives of these proteins, there may be mentioned in particular the Math-1 
(Akazawa et al., 1995, J. Bol. Chem., 279,8730-38), Mash-1 (Johnson et al, 1990, 
346, 858-61), and NeuroD (Lee et al., 1995, Sciences, 268, 836-44) proteins. 
Evidently, other proteins comprising this bHLH domain are involved in neuronal 
5 development at the level of other sites and at variable times. The identification of 
new bHLH-type proteins would be particularly valuable for increasing our 
understanding of neurogenesis and therefore be advantageous from the therapeutic 
point of view- 
Members of the basic helix loop helix (bHLH) transcription factor family 
1 0 play a key role in a wide variety of cell fate determination and specification 
processes. These processes are as diverse as cardial muscle development^ 

2 • * 3 

skeletal development , mesodermal cell determination , dermal cell 
differentiation"^ and more particularly, myogenesis^ and neurogenesis^. 

Our understanding of the molecular basis of vertebrate nervous system 
15 development has benefitted fi:om the identification of bHLH factors analogous to 
those regulating neurogenesis in Drosophila, Li the fly, proneural bHLH genes, 
such as the four genes of the achaete-scute complex {AS-C) and the atonal gene 
are required for the fate determination of neuronal precursors within the 
ectoderm^' ^. The expression of proneural genes in Drosophila is restricted to the 
2 0 neural precursor cells by lateral inhibition mediated by the Notch signaling 
pathway^' ^. The search for vertebrate homologs of the Drosophila proneural 
genes has led to the identification of numerous genes encoding bHLH proteins. 
The AS'C homolog Mashl, the atonal homolog Mathl and the most recently 
identified genes of the neurogenin (ngn) family, including ngnl/Math4C, 

2 5 ngn2/Math4A and ngn2/Math4B/Relax have temporal and spatial expression 

patterns consistent with a fimction in neuronal determination^^. The three 
neurogenin genes encode basic helix loop helix transcription factors. Functional 
analysis of these murine genes has shown that they play a key role in the 
determination of neuronal cell fate. In contrast, several other genes of the 

3 0 NeuroD family, including NeuroD/Betal^ Math2/Nexl, Math3 are more likely to 
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be involved in differentiation than determination^^. Functional analysis of these 
vertebrate genes has clearly suggested that the overall mechanism controlling 
neurogenesis is conserved throughout evolution at the fate determination and 
specification levels. The precise role in the developing nervous system of the 
5 putative neuronal determination genes has been investigated by gain-of- function 
and loss-of-function studies. The overexpression in Xenopus or Zebrafish 
embryos of some of the latter genes converts non-neural ectoderm into neurons 
suggesting that these genes encode neuronal determination factors^ Analysis 
of null mutations of Mashl, ngnl and ngn2 has demonstrated that these genes 
1 0 have true determination functions affecting the development of at least a subset 
of neuronal cells^^'^^. 

SUMMARY OF THE INVENTION 

The subject of the present invention is precisely the isolation, sequencing 
15 and characterization of cDNAs encoding polypeptides that belong to a new family 
of bHLH-type proteins. It also describes recombinant DNAs incorporating these 
nucleic acids, vectors containing them, and their use for activating the expression 
of recombinant proteins. 

Apphcants have now identified polypeptides possessing a high homology 
2 0 with the bHLH proteins already identified, a homology that is limited to the bHLH 
domain. Indeed, unexpectedly, the regions bordering this bHLH domain share no 
amino acid sequence identity with those of other bHLH-type proteins. 

Consequently, the first subject of the present invention is a bHLH-type 
polypeptide endowed with a transcriptional activity, characterized in that it shares 

2 5 amino acid sequence homology with the bHLH-type proteins only at the level of its 

bHLH domain. 

More precisely, the present invention relates to a polypeptide possessing a 
transcriptional activity, characterized in that it comprises completely or partly an 
amino acid sequence of either one of SEQ ID NOs: 8 or 10, or a derivative thereof 

3 0 Preferably, it is a bHLH-type polypeptide comprising an amino acid 
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sequence represented in either one of SEQ ID NOs: 8 or 10. In a preferred 
embodiment, the bHLH-type polypeptide comprises 214 amino acids (SEQ ID NO: 
8), for which the expression of the corresponding mRNA is detected in rat embryo 
in the form of longitudinal bands. This protein is designated hereinafter by the 
5 name Relax for "Rat Embryonic Longitudinal Axis" and constitutes a polypeptide 
according to the invention. 

Comparison of the amino acid sequence of Relax with those of already 
available proteins has made it possible to establish a sequence correlation 
essentially with the bHLH-type proteins. Moreover, this homology has been 
1 0 located only at the level of the bHLH domain. The bHLH domain of Relax thus 
shares respectively 68%, 57% and 40% homology with the bHLH domains of the 
NeuroD, Math-1 and Mash-1 proteins. 

From tests of activity carried out with the Relax protein and more precisely 
discussed in the examples below, it is evident that this protein is capable of binding 
15 specifically to an E box sequence and that in cotransfection trials, it behaves like a 
transcriptional activator and that this activity is strictly dependent on the presence 
of an intact E box. All these observations demonstrate the membership of the 
claimed Relax protein of a new family of bHLH proteins. 

In another preferred embodiment, the bHLH-type polypeptide comprises 
20 214 amino acids (SEQ ID NO: 10), which is designated hereinafter as human 
neurogenin 3 (hngnS) and constitutes a polypeptide according to the invention. 

The subject of the present invention is also a nucleic acid that encodes a 
bHLH-type polypeptide according to the invention. 

More precisely, the present invention relates to a nucleic acid comprising a 

2 5 polynucleotide sequence, characterized in that it is represented completely or partly 

by SEQ ID NOs: 1 or 9, or a complementary polynucleotide sequence or derivative 
thereof. 

The claimed nucleic acids may be complementary DNA (cDNA), genomic 
DNA (gDNA) or RNA fragments. They may be more particularly cDNA or gDNA, 

3 0 More preferably, it is a nucleic acid comprising a polynucleotide sequence 
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of either one of SEQ ID NOs: 1 or 9, or of a complementary polynucleotide 
sequence thereof. 

The present invention is also intended to cover corresponding anti-sense 
5 nucleic acids whose expression makes it possible to control the transcription of 
cellular mRNAs, Such nucleic acids may comprise all or part of a nucleic acid 
polynucleotide sequence according to the invention, transcribed in the reverse 
orientation. They are complementary or significantly complementary to the 
polynucleotide sequence of a nucleic acid according to the invention, either in total 
10 or in part. 

The subject of the present invention is also the use of the polypeptides of 
the invention to control and/or participate in the expression of genes. These 
transcriptional activators may be most particularly advantageous for targeting the 
expression of a protein at the level of the central nervous system. 

1 5 The polypeptides of the invention may be obtained by expressing, in a 

cellular host, a nucleic acid as described above, incorporated or otherwise into a 
recombinant DNA, using the techniques known to persons skilled in the art, or by a 
combination of these techniques. 

Thus, the invention relates to a method for the production of a 

2 0 polypeptide according to the invention, wherein said method comprises the steps 
of: 

a) inserting a nucleic acid encoding said polypeptide into an appropriate 

vector; 

b) culturing, in an appropriate culture medium, a previously transformed 

2 5 host cell or transfecting a host cell with the recombinant vector of step a); 

c) recovering the conditioned culture medium or lysing the host cell, for 
example by sonication or by osmotic shock; 

d) separating and purifying said polypeptide from said culture medium or 
altematively from the cell lysates obtained in step c); and 

3 0 e) where appropriate, characterizing the recombinant polypeptide produced. 
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Preferably, the nucleic acids according to the invention form part of a 
vector useful for inducing in vivo, ex vivo and/or in vitro the expression of the 
claimed polypeptides. The vector used may be of various origins, provided that it is 
capable of transforming animal cells, preferably human nerve cells. In a preferred 
5 embodiment of the invention, a viral vector, which may be derived jfrom 

adenoviruses, retroviruses, adeno-associated viruses (AAV), the herpesvirus, the 
cytomegalovirus (CMV), the vaccinia virus, and the like, is used. 

The present invention therefore also relates to any recombinant virus 
comprising, inserted into its genome, a nucleic acid according to the invention. 
1 0 Preferably, the recombinant virus according to the invention is a replication 
defective virus. 

It is particularly advantageous to use the nucleic sequences of the invention 
in a form incorporated into an adenovirus, an AAV or a recombinant retrovirus that 
is defective. According to a preferred embodiment, it is an adenovirus. The vector, 
1 5 nucleic acid or polypeptide of the invention can be in an isolated or purified form. 

The subject of the present invention is also a pharmaceutical composition 
comprising at least one nucleic acid, vector or polypeptide according to the 
invention and a pharmaceutically compatible excipient. 

It also relates to any use of a nucleic acid, recombinant vector, or 
2 0 polypeptide according to the invention for the manufacture or preparation of a 

pharmaceutical composition or medicament intended for the treatment of a disease, 
condition, or pathology affecting the nervous system. 

The present invention also provides nucleotide probes and primers that 
hybridize vv^ith a nucleic acid sequence of a nucleic acid according to the 
2 5 invention. The nucleotide probes or primers according to the invention comprise 
at least 8 consecutive nucleotides of a nucleic acid comprising a polynucleotide 
sequence of either one of SEQ ID NOs: 1 or 9, or of a complementary 
polynucleotide sequence. Preferably, nucleotide probes or primers according to 
the invention will have a length of at least 10, 12, 15, 18, 20 to 25, 35, 40, 50, 
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70, 80, 100, 200, 500, 1000, or 1500 consecutive nucleotides of a nucleic acid 
according to the invention. 

The present invention also relates to a method of amplifying a nucleic acid 
according to the invention contained in a sample, wherein said method comprises 
5 the steps of: 

a) bringing the sample in which the presence of the target nucleic acid is 
suspected into contact with a pair of nucleotide primers whose hybridization 
position is located respectively on the 5 'side and on the 3 'side of the region of the 

1 0 target nucleic acid whose amplification is sought, in the presence of the reagents 
necessary for the amplification reaction; and 

b) detecting the amplified nucleic acids. 

The present invention also relates to a method of detecting the presence of a 
nucleic acid according to the invention in a sample, wherein said method comprises 
15 the steps of: 

1) bringing one or more nucleotide probes according to the invention into 
contact with the sample to be tested; 

2) detecting the complex which may have formed between the probe(s) and 
the nucleic acid present in the sample. 

2 0 Another subject of the invention is a box or kit for amplifying all or part of 

a nucleic acid according to the invention, wherein said box or kit comprises: 

1) a pair of nucleotide primers in accordance with the invention, whose 
hybridization position is located respectively on the 5 'side and 3 'side of the 
target nucleic acid whose amplification is sought; and optionally, 

2 5 2) reagents necessary for an amplification reaction. 

The invention also relates to a box or kit for detecting the presence of a 
nucleic acid according to the invention in a sample, said box or kit comprising: 

a) one or more nucleotide probes according to the invention; and 

b) where appropriate, reagents necessary for a hybridization reaction. 

3 0 The present invention also relates to a method of screening for antagonists 
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or agonists of the polypeptides according to the invention. Preferably, the method 
of screening for antagonists or agonists of the polypeptides according to the 
invention is directed toward identifying an antagonist or agonist of a polypeptide 
comprising an amino acid sequence of either one of SEQ ID NO: 8 or SEQ ID NO: 
5 10. 

In a specific embodiment, the present invention relates to a method of 
identifying a modulator, agonist, or antagonist of a polypeptide according to the 
invention in a sample comprising 

a) obtaining a cell, for example a cell line, that, either naturally or after 
1 0 transfecting the cell with a nucleic acid encoding a polypeptide according to the 

invention, expresses a polypeptide according to the invention, 

b) transfecting the cell with a nucleic acid encoding a marker gene, such as 
p-galactosidase (p-gal), 

c) incubating the cell of step b) with a test solution or sample comprising a 
15 potential modulator, agonist or antagonist, 

d) measuring the amount of p-galactosidase activity, and 

e) comparing the amount of p-galactosidase activity measured in step d) 
with an amount of p-galactosidase activity measured with a cell that has not been 
previously incubated in the presence of the candidate modulator, agonist, or 

2 0 antagonist compoimd for the polypeptide according to the invention. 

The present invention also relates to an antibody directed against a 
polypeptide according to the invention. Preferably, the antibody of the invention is 
directed against a polypeptide comprising an amino acid sequence of either one of 
SEQ ID NO: 8 or SEQ ID NO: 10. 

2 5 The present invention relates to a method of detecting the presence of a 

polypeptide of the invention in a sample comprising the polypeptide. The present 
invention also relates to a kit for detecting the presence of a polypeptide of the 
invention in a sample comprising the polypeptide. 

The present invention relates to a method of detecting the presence of a 

3 0 nucleic acid according to the invention in a sample comprising the nucleic acid. 
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The present invention also relates to a kit for detecting the presence of a nucleic 
acid according to the invention in a sample comprising the nucleic acid. 

The present invention also relates to a composition comprising a nucleic 
acid encoding a polypeptide according to the invention, wherein the nucleic acid is 
placed under the control of appropriate regulatory elements. 

Thus, the present invention also relates to a pharmaceutical composition 
comprising a nucleic acid, polypeptide, or recombinant vector according to the 
invention, combined with one or more physiologically compatible vehicles and/or 
excipients. 

The invention relates to an isolated recombinant host cell comprising a 

nucleic acid of the invention. 

According to another aspect, the invention also relates to an isolated 
recombinant host cell comprising a recombinant vector according to the invention. 
Therefore, the invention also relates to a recombinant host cell comprising a 
recombinant vector comprising a nucleic acid of the invention. 

The present invention also relates to the use of isolated cells genetically 
modified ex vivo with a nucleic acid or recombinant vector according to the 
invention, or of isolated cells producing a recombinant vector, wherein the cells are 
implanted in the body, to allow a prolonged and effective expression in vivo of a 
biologically active polypeptide according to the invention. 

Thus, the invention also relates to the use of a recombinant host cell 
according to the invention, comprising a nucleic acid encoding a polypeptide 
according to the invention for the manufacture or preparation of a pharmaceutical 
composition or medicament intended for the prevention of, or more particularly, 
for the treatment of subjects affected by a nervous system pathology, dysfunction, 
disorder or condition. 

The invention relates to the use of a defective recombinant virus according 
to the invention for the preparation of a pharmaceutical composition intended for 
the treatment and/or for the prevention of a disorder or condition affecting the 
nervous system. Thus, the present invention also relates to a pharmaceutical 
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composition comprising a defective recombinant viras according to the invention, 
combined with one or more physiologically compatible vehicles and/or excipients. 

The present invention also relates to the use of recombinant host cells 
genetically modified ex vivo with a recombinant defective virus according to the 
invention, or of recombinant host cells producing such viruses, implanted in the 
body, allowing a prolonged and effective expression in vivo of a biologically active 
protein of the invention. A specific embodiment of the invention is an isolated 
mammalian cell infected with one or more defective recombinant viruses according 
to the invention. 

Another subject of the invention relates to an implant comprising isolated 
mammalian cells infected with one or more defective recombinant viruses 
according to the invention or cells producing recombinant viruses, and an 
extracellular matrix. More particularly, in the implants of the invention, the 
extracellular matrix comprises a gelling compound and optionally, a support 
allowing the anchorage of the cells. 

Finally, the invention extends, in addition, to any use of a polypeptide, 
nucleic acid, recombinant vector, recombinant virus, recombinant host cell, or 
implant according to the invention to prevent and/or treat a dysfimction, disease, 
condition, or pathology affecting the nervous system or to control, influence, 
and/or participate in the expression of a gene. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1: Nucleotide sequence of the cDNA CIG235. The various ohgonucleotides 
used to isolate the complete cDNA are indicated on this sequence. The arrows 
indicate the 5* to 3* orientation of these oligonucleotides. 

Figure 2: Comparison of the amino acid sequences of the bHLH domain of Relax 
with other proteins of the family. The homologous amino acids are noted in black, 
and the amino acids of similar structure in grey. 

Figure 3: Sequence of the hngn3 gene. The hngnS coding region is indicated by 
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capital letters. The deduced amino acid sequence is presented above the 
nucleotidic sequence. Both nucleotide and amino acid numbers are indicated on 
the left. The 138 bp PGR product ampUfied from the basic and hehx n domain is 
underlined. The black arrow marks the 5' end of the 3 '-anchored PGR product 

5 whereas the gray arrow indicates the 3' end of the 5 '-anchored PGR product. 
The intron sequence interrupting the 5' untranslated region is boxed in gray. 

Figure 4: The amino acid sequences encoded by hngn3, ngnS^ relax, ngnl, ngn2 
and neuroDS genes were aligned using the Glustal-X program. The identity 
between the amino acid residues is displayed in color according to Glustal-X 
1 0 program parameters, (*) indicates residues conserved in all sequences. 

Figure 5: Optimal alignments obtained with the Glustal-X program were 
subjected to phylogenetic analysis with distance methods (Neighbor- Joining). 
The resulting trees correspond to the alignment of the bHLH domains only (a) or 
of the entire amino acid sequences (b). Bootstrap values are indicated near each 

1 5 branch, (a) Five paralogous groups highlighted on the right were clearly 

identified. This result is supported by the bootstrap value boxed in gray, (b) The 
phylogenetic relationships within the neurogenin family define three paralogous 
groups. All knovra members of the neurogenin are represented in this tree. Gene 
names according to the original publications are indicated in capital letters. The 

2 0 species from which the genes were isolated are indicated as a lower case prefix, 
dm ^ Drosophila melanogaster; ce = Caenorhabditis elegans; dr = Danio rerio; 
xl = Xenopus laevis; gg ^ Gallus gallus; mm = Mus musculus; m = Rattus 
norvegicus; hs ^ Homo sapiens, Genebank accession numbers are as follows: 
dmDA = J03148; dmTS - X52892; dm T5 = M17120; dmT4 = M171 19; dmT3 

25 - X12549; ceLIN32 - U15418; dmATONAL - L36646; drATHl - AF024536; 
ggGATHl = U61149; mmMATHl = D43694; hsHATHl = U61 148; mmMath3 
= D85845; drNEUROD = AF036148; mmMATH2 - D44480; mmNEUROD2 = 
U58771; mmNEUROD = U28068; hsNEUROD = D82347; dmTAP = X95845; 
xlNGNRl A = U67778; mmNGN2 = U76207; ggNGN2 = AJ012659; drNGNl = 
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AF017301; ggNGNl = AJ012660; mmNGNl = U67776; mNGNl = U67777; 
hsNEUROD3 = U63842; nmiNGNB - U76208; mRELAX = Y10619; 
hsHNGNS = AJl 33776; mniTWIST - M63649; mmSCLERAXIS - S78079; 
mmPARAXIS = U18658; mmEHAND = S79216; mmDHAND = U40039. 

5 Figure 6: The chromosomal location of hngnS was determined by radiation 
hybrid mapping. Linkage analysis mapped hngn3 to 10q21.3, between the 
D10S1418 (GDB 684330) and D10S1603 (GDB 589266) markers (boxed in 
gray). A comprehensive map of chromosome 10 is shown on the left-hand side 
and a fine map of the region containing hngn3 is shown on the right-hand side. 

1 0 The maps were obtained using the mapview program of the genome data base 
(GDB) package. The name of each gene and marker and their GDB accession 
numbers are as follow: D10S1418 (GDB 684330); D10S1603 (GDB 589266); 
CHAT choline acetyltransferase (GDB 1 19775); PAX2 = paired box homeotic 
gene 2 (GDB 138771), ANX7 = annexin VH (GDB 369042 ); PSAP = 

15 prosaponin (GDB 120366); PRGl = platelet proteoglycan 1 protein core (GDB 
120312); GLUDP3 = glutamate dehydrogenase pseudogene 3 (GDB 137196); 
ANK3 = ankyrin 3 (GDB 424503); DNA2L = DNA rephcation heUcase (yeast 
homolog) -like (GDB 1313702); MATIA = methionine adenosyltransferase I 
alpha (GDB 129077); CAMKG = CaM kinase H gamma (GDB 138469); HKl = 

2 0 hexokinase 1 (GDB 120044). 

Figure 7: Transverse sections of human embryos at Carnegie stage 18 were probed 
with an hngnS antisense riboprobe (a), (c)-(h) and a sense probe (b). In the spinal 
cord (a), (b), (e)-(h) hngnS -positive cells are present in two longitudinal stripes (a) 
and (e). The ventral stripe (full arrow) is adjacent to the floor plate (fp). The medial 
25 stripe (arrow head) is located around the midpoint of the dorso-ventral axis. Both 
stripes are contained within the ventricular zone as shown by phase contrast 
microscopy (f). (g) high magnification of the ventral stripe, (h) high magnification 
of the medial stripe. hngnS is expressed in scattered cells throughout the pancreas 
as shown on the transverse section of the left lobe 

3 0 of the pancreas. At higher magnification, the hngn3 -positive cells appear to be 
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organized 

as clusters (d). Scale bars (a) and (b) 200^m; (c) lOOjim; (e) and (f) 50^m; (d), (g) 
and (h) 25 [am. 

5 DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

The following defined terms are used throughout the present specification, 
and should be helpfiil in xmderstanding the scope and practice of the present 
10 invention. 

hi a specific embodiment, the term "about" or "approximately" means 
within 20%, preferably within 10%, and more preferably within 5% of a given 
value or range. 

A "nucleic acid" is a polymeric compoimd comprised of covalently linked 
1 5 subimits called nucleotides. Nucleic acid includes polyribonucleic acid (RNA) and 
polydeoxyribonucleic acid (DNA), both of which may be single-stranded or 
double-stranded. DNA includes cDNA, genomic DNA, synthetic DNA, and semi- 
synthetic DNA. 

A "gene" refers to an assembly of nucleotides that encode a polypeptide, 
2 0 and includes cDNA and genomic DNA nucleic acids. 

A "recombinant DNA molecule" is a DNA molecule that has undergone a 
molecular biological manipulation. 

A "vector" is any means for the transfer of a nucleic acid into a host cell. A 
vector may be a repHcon to which another DNA segment may be attached so as to 

2 5 bring about the replication of the attached segment. A "replicon" is any genetic 

element (e.g., plasmid, phage, cosmid, chromosome, virus) that functions as an 
autonomous unit of DNA replication in vivo, i.e., capable of replication under its 
own control. The term "vector" includes both viral and nonviral means for 
introducing the nucleic acid into a cell in vitro, ex vivo or in vivo. Viral vectors 

3 0 include retrovirus, adeno-associated virus, pox, baculovirus, vaccinia, herpes 
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simplex, Epstein-Barr and adenovirus vectors, as set forth in greater detail below. 
Non-viral vectors include plasmids, liposomes, electrically charged lipids 
(cytofectins), DNA-protein complexes, and biopolymers. hi addition to a nucleic 
acid, a vector may also contain one or more regulatory regions, and/or selectable 
markers useful in selecting, measuring, and monitoring nucleic acid transfer results 
(transfer to which tissues, duration of expression, etc.). 

A "cloning vector" is a replicon, such as plasmid, phage or cosmid, to 
which another DNA segment may be attached so as to bring about the replication 
of the attached segment. Cloning vectors may be capable of replication in one cell 
type, and expression in another ("shuttle vector"). 

A "cassette" refers to a segment of DNA that can be inserted into a vector at 
specific restriction sites. The segment of DNA encodes a polypeptide of interest, 
and the cassette and restriction sites are designed to ensure insertion of the cassette 
in the proper reading frame for transcription and translation. 

A cell has been "transfected" by exogenous or heterologous DNA when 
such DNA has been introduced inside the cell. A cell has been "transformed" by 
exogenous or heterologous DNA when the transfected DNA effects a phenotypic 
change. The transforming DNA can be integrated (covalently linked) into 
chromosomal DNA making up the genome of the cell. 

A "nucleic acid molecule" refers to the phosphate ester polymeric form of 
ribonucleosides (adenosine, guanosine, uridine or cytidine; "RNA molecules") or 
deoxyribonucleosides (deoxyadenosine, deoxyguanosine, deoxj^ymidine, or 
deoxycytidine; "DNA molecules"), or any phosphoester anologs thereof, such as 
phosphorothioates and thioesters, in either single stranded form, or a double- 
stranded helix. Double stranded DNA-DNA, DNA-RNA and RNA-RNA helices 
are possible. The term nucleic acid molecule, and in particular DNA or RNA 
molecule, refers only to the primary and secondary structure of the molecule, and 
does not limit it to any particular tertiary forms. Thus, this term includes double- 
stranded DNA found, inter alia, in linear or circular DNA molecules {e.g., 
restriction fragments), plasmids, and chromosomes. In discussing the structure of 
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particular double-stranded DNA molecules, sequences may be described herein 
according to the normal convention of giving only the sequence in the 5' to 3' 
direction along the nontranscribed strand of DNA (ie., the strand having a 
sequence homologous to the mRNA). A "recombinant DNA molecule" is a DNA 
molecule that has undergone a molecular biological manipulation. 

For the purposes of the present invention, the term "derivative" designates 
any sequence differing from the sequence considered as a resuh of a degeneracy of 
the genetic code, obtained by one or more modifications of a genetic and/or 
chemical nature, as well as any sequence that hybridizes with these sequences or 
fragments thereof and whose product possesses the activity indicated. 

Modification of a genetic and/or chemical nature is xmderstood to mean any 
mutation, substitution, deletion, addition and/or modification of one or more 
residues. Such derivatives maybe generated for various pxnposes, such as in 
particular that of enhancing its production levels that of increasing and/or 
modifying its activity, or that of conferring new pharmacokinetic and/or biological 
properties on it. Among the derivatives resulting from an addition, there may be 
mentioned, for example, the chimeric nucleic sequences comprising an additional 
heterologous part linked to one end, for example of the hybrid construct type 
consisting of a cDNA with which one or more introns would be associated. 

Likewise, for the purposes of the invention, the claimed nucleic acids may 
comprise promoter, activating or regulatory sequences, and the like. 

A nucleic acid molecule is "hybridizable" to another nucleic acid molecule, 
such as a cDNA, genomic DNA, or RNA, when a single stranded form of the 
nucleic acid molecule can anneal to the other nucleic acid molecule under the 
appropriate conditions of temperature and solution ionic strength (see Sambrook et 
al., supra). The conditions of temperature and ionic strength determine the 
"stringency" of the hybridization. For preliminary screening for homologous 
nucleic acids, low stringency hybridization conditions, corresponding to a Tm of 
55°, can be used, e.g., 5x SSC, 0.1% SDS, 0.25% milk, and no formamide; or 30% 
formamide, 5x SSC, 0.5% SDS). Moderate stringency hybridization conditions 
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correspond to a higher Tm, e.g., 40% formamide, with 5x or 6x SCC. High 
stringency hybridization conditions correspond to the highest Tm, e.g., 50% 
formamide, 5x or 6x SCC. Hybridization requires that the two nucleic acids 
contain complementary sequences, although depending on the stringency of the 
hybridization, mismatches between bases are possible. The appropriate stringency 
for hybridizing nucleic acids depends on the length of the nucleic acids and the 
degree of complementation, variables well known in the art. The greater the degree 
of similarity or homology between two nucleotide sequences, the greater the value 
of Tm for hybrids of nucleic acids having those sequences. The relative stability 
(corresponding to higher Tm) of nucleic acid hybridizations decreases in the 
following order: RNArRNA, DNA:RNA, DNA:DNA. For hybrids of greater than 
100 nucleotides in length, equations for calculating Tm have been derived {see 
Sambrook et aL, supra, 9.50-0.51). For hybridization with shorter nucleic acids, 
i.e., oligonucleotides, the position of mismatches becomes more important, and the 
length of the oligonucleotide determines its specificity {see Sambrook et al., supra, 
1 1 .7-11.8). Preferably a minimum length for a hybridizable nucleic acid is at least 
about 10 nucleotides; preferably at least about 15 nucleotides; and more preferably 
the length is at least about 20 nucleotides. 

In a specific embodiment, the term "standard hybridization conditions" 
refers to a Tm of 55°C, and utilizes conditions as set forth above. In a preferred 
embodiment, the Tm is 60°C; in a more preferred embodiment, the Tm is 65''C, 

"High stringency hybridization conditions" for the purposes of the present 
invention will be understood to mean the following conditions: 
1- Membrane competition and PREHYBRIDIZATION : 

- Mix: 40 \i\ salmon sperm DNA (10 mg/ml) 

+ 40 \i\ human placental DNA (10 mg/ml) 

- Denature for 5 minutes at 96°C, then immerse the mixture in ice. 

- Remove the 2X SSC and pour 4 ml of formamide mix in the hybridization tube 
containing the membranes. 

- Add the mixture of the two denatured DNAs. 
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- Incubation at 42'*C for 5 to 6 hours, with rotation. 

2- Labeled probe competition : 

- Add to the labeled and purified probe 10 to 50 |li1 Cot I DNA, depending on the 
quantity of repeats. 

- Denature for 7 to 10 minutes at 95°C. 

- Incubate at 65°C for 2 to 5 hours. 

3- HYBRIDIZATION : 

- Remove the prehybridization mix. 

- Mix 40 [il salmon sperm DNA + 40 iil human placental DNA; denature for 5 min 
at 96°C, then immerse in ice. 

- Add to the hybridization tube 4 ml of formamide mix, the mixture of the two 
DNAs and the denatured labeled probe/Cot I DNA . 

- Incubate 15 to 20 hours at 42°C, with rotation. 

4- Washes and Exposure : 

- One wash at room temperature in 2X SSC, to rinse. 

- Twice 5 minutes at room temperature 2X SSC and 0.1% SDS at 65°C. 

- Twice 15 minutes at 65^C IX SSC and 0.1% SDS at 65°C. 

- Envelope the membranes in clear plastic wrap and expose. 

The hybridization conditions described above are adapted to hybridization, 
under high stringency conditions, of a molecule of nucleic acid of varying length 
from 20 nucleotides to several hundreds of nucleotides. It goes without saying that 
the hybridization conditions described above may be adjusted as a function of the 
length of the nucleic acid whose hybridization is sought or of the type of labeling 
chosen, according to techniques known to one skilled in the art. Suitable 
hybridization conditions may, for example, be adjusted according to the teaching 
contained in Hames and Higgins, 1985 (Nucleic acid hybridization : a practical 
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approach, Hames and Higgins Ed., IRL Press, Oxford) and Ausubel et al., 1989 
(Current Protocols in Molecular Biology, Green Publishing Associates and Wiley 
Interscience, N.Y.)* 

As used herein, the term "oligonucleotide" refers to a nucleic acid, 
generally of at least 18 nucleotides, that is hybridizable to a genomic DNA 
molecule, a cDNA molecule, or an mRNA molecule encoding a polypeptide of the 

32 

invention. Oligonucleotides can be labeled, e.g., with P-nucleotides or 
nucleotides to which a label, such as biotin, has been covalently conjugated, hi one 
embodiment, a labeled ohgonucleotide can be used as a probe to detect the 
presence of a nucleic acid encoding a polypeptide of the invention. In another 
embodiment, ohgonucleotides (one or both of which may be labeled) can be used 
as PGR primers, either for cloning full length or a fragment of a polypeptide of the 
invention, or to detect the presence of nucleic acids encoding a polypeptide of the 
invention. In a further embodiment, an oligonucleotide of the invention can form a 
triple helix with a DNA molecule of the invention. Generally, ohgonucleotides are 
prepared synthetically, preferably on a nucleic acid synthesizer. Accordingly, 
oligonucleotides can be prepared with non-naturally occurring phosphoester analog 
bonds, such as thioester bonds, etc. 

A DNA "coding sequence" is a double-stranded DNA sequence that is 
transcribed and translated into a polypeptide in a cell in vitro or in vivo when 
placed under the control of appropriate regulatory sequences. The boundaries of 
the coding sequence are determined by a start codon at the 5' (amino) terminus and 
a translation stop codon at the 3' (carboxyl) terminus. A coding sequence can 
include, but is not limited to, prokaryotic sequences, cDNA from eukaryotic 
mRNA, genomic DNA sequences from eukaryotic (e.g., mammahan) DNA, and 
even synthetic DNA sequences. If the coding sequence is intended for expression 
in a eukaryotic cell, a polyadenylation signal and transcription termination 
sequence will usually be located 3' to the coding sequence. 

Transcriptional and translational control sequences are DNA regulatory 
sequences, such as promoters, enhancers, terminators, and the like, that provide for 
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the expression of a coding sequence in a host cell, hi eukaryotic cells, 
polyadenylation signals are control sequences. 

A "promoter sequence" is a DNA regulatory region capable of binding 
RNA polymerase in a cell and initiating transcription of a downstream (3' 
5 direction) coding sequence. For purposes of defmuig the present invention, the 
promoter sequence is bounded at its 3' terminus by the transcription initiation site 
and extends upstream (5' direction) to include the minimum number of bases or 
elements necessary to initiate transcription at levels detectable above background. 
Within the promoter sequence will be found a transcription initiation site 
1 0 (conveniently defined for example, by mapping with nuclease S 1), as well as 
protein binding domains (consensus sequences) responsible for the binding of 
RNA polymerase. 

A coding sequence is "under the control" of transcriptional and translational 
control sequences in a cell when RNA polymerase transcribes the coding sequence 
15 into mRNA, which is then trans-RNA spliced (if the coding sequence contains 
introns) and translated into the protein encoded by the coding sequence. 

As used herein, the term "homologous" in all its grammatical forms and 
spelling variations refers to the relationship between proteins that possess a 
"common evolutionary origin," including proteins fi*om superfamihes (e.g., the 
2 0 immunoglobulin superfamily) and homologous proteins from different species 
(e.g., myosin light chain, etc.) (Reeck et ah, 1987, Cell 50:667). Such proteins 
(and their encoding genes) have sequence homology, as reflected by their high 
degree of sequence similarity. 

Accordingly, the term "sequence similarity" in all its grammatical forms 

2 5 refers to the degree of identity or correspondence between nucleic acid or amino 

acid sequences of proteins that may or may not share a common evolutionary 
origin {see Reeck et al., supra). However, in common usage and in the instant 
apphcation, the term "homologous," when modified with an adverb such as 
"highly," may refer to sequence similarity and not a common evolutionary origin. 

3 0 In a specific embodiment, two DNA sequences are "substantially 
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homologous" or "substantially similar" when at least about 50% (preferably at least 
about 75%, and most preferably at least about 90 or 95%) of the nucleotides match 
over the defined length of the DNA sequences. Sequences that are substantially 
homologous can be identified by comparing the sequences using standard software 
available in sequence data banks, or in a Southern hybridization experiment under, 
for example, stringent conditions as defined for that particular system. Defining 
appropriate hybridization conditions is within the skill of the art. See, e.g., 
Maniatis et ah, supra\ DNA Cloning, Vols. I & H, supra\ Nucleic Acid 
Hybridization, supra. 

An "antisense nucleic acid" is a sequence of nucleotides that is 
complementary to the sense sequence. Antisense nucleic acids can be used to 
down regulate or block the expression of the polypeptide encoded by the sense 
strand. The present invention is also intended to cover corresponding anti-sense 
nucleic acids whose expression makes it possible to control the transcription of 
cellular mRNAs. Such nucleic acids may comprise all or part of a nucleic acid 
polynucleotide sequence according to the invention, transcribed in the reverse 
orientation. They are complementary or significantly complementary to the 
polynucleotide sequence of a nucleic acid according to the invention, either in total 
or in part. 

Transcriptional and translational control sequences are DNA regulatory 
sequences, such as promoters, enhancers, terminators, and the like, that provide for 
the expression of a coding sequence in a host cell, hi eukaryotic cells, 
polyadenylation signals are additional types of control sequences. 

A "signal sequence" is included at the beginning of the coding sequence of 
a protein to be expressed on the surface of a cell. This sequence encodes a signal 
peptide, N-terminal to the mature polypeptide, that directs the host cell to 
translocate the polypeptide. The term "translocation signal sequence" is used 
herein to refer to this sort of signal sequence. Translocation signal sequences can 
be found associated with a variety of proteins native to eukaryotes and prokaryotes, 
and are often fimctional in both types of organisms. 
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"Regulatory region" means a nucleic acid sequence which regulates the 
expression of a second nucleic acid sequence. A regulatory region may include 
sequences which are naturally responsible for expressing a particular nucleic acid 
(a homologous region) or may include sequences of a different origin which are 
responsible for expressing different proteins or even synthetic proteins (a 
heterologous region). Li particular, the sequences can be sequences of eukaryotic 
or viral genes or derived sequences which stimulate or repress transcription of a 
gene in a specific or non-specific manner and in an inducible or non-inducible 
maimer. Regulatory regions include origins of replication, RNA splice sites, 
promoters, enhancers, transcriptional termination sequences, signal sequences 
which direct the polypeptide into the secretory pathways of the target cell, and 
promoters. 

A regulatory region from a "heterologous source" is a regulatory region 
which is not naturally associated with the expressed nucleic acid. Included among 
the heterologous regulatory regions are regulatory regions from a different species, 
regulatory regions from a different gene, hybrid regulatory sequences, and 
regulatory sequences which do not occur in nature, but which are designed by one 
having ordinary skill in the art. 

"Heterologous" DNA refers to DNA not naturally located in the cell, or in a 
chromosomal site of the cell. Preferably, the heterologous DNA includes a gene 
foreign to the cell. 

"Homologous recombination" refers to the insertion of a foreign DNA 
sequence into another DNA molecule, e.g., insertion of a vector in a chromosome. 
Preferably, the vector targets a specific chromosomal site for homologous 
recombination. For specific homologous recombination, the vector will contain 
sufficiently long regions of homology to sequences of the chromosome to allow 
complementary binding and incorporation of the vector into the chromosome. 
Longer regions of homology, and greater degrees of sequence similarity, may 
increase the efficiency of homologous recombination. 

A "polypeptide" is a polymeric compound comprised of covalently linked 
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amino acid residues. Amino acids have the following general structure: 

H 

1 

R_C-COOH 
1 

NH2 

Amino acids are classified into seven groups on the basis of the side chain R: (1) 
aliphatic side chains, (2) side chains containing a hydroxylic (OH) group, (3) side 
chains containing sulfur atoms, (4) side chains containing an acidic or amide 
group, (5) side chains containing a basic group, (6) side chains containing an 
aromatic ring, and (7) proline, an imino acid in which the side chain is fused to the 
amino group, A polypeptide of the invention preferably comprises at least about 
14 amino acids. 

A "protein" is a polypeptide which plays a structural or functional role in a 
living cell. 

A "variant" of a polypeptide or protein is any analogue, fragment, 
derivative, or mutant which is derived fi-om a polypeptide or protein and which 
retains at least one biological property of the polypeptide or protein. Different 
variants of the polypeptide or protein may exist in nature. These variants may be 
allelic variations characterized by differences in the nucleotide sequences of the 
structural gene coding for the protein, or may involve differential splicing or post- 
translational modification. The skilled artisan can produce variants having single 
or multiple amino acid substitutions, deletions, additions, or replacements. These 
variants may include, inter alia: (a) variants in which one or more amino acid 
residues are substituted with conservative or non-conservative amino acids, (b) 
variants in which one or more amino acids are added to the polypeptide or protein, 
(c) variants in which one or more of the amino acids includes a substituent group, 
and (d) variants in which the polypeptide or protein is fused with another 
polypeptide such as serum albumin. The techniques for obtaining these variants, 
including genetic (suppressions, deletions, mutations, etc.), chemical, and 
enzymatic techniques, are known to persons having ordinary skill in the art. A 
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variant of the invention preferably comprises at least about 14 amino acids. 

If such allelic variations, analogues, fragments, derivatives, mutants, and 
modifications, including alternative mRNA spUcing forms and alternative post- 
translational modification forms result in derivatives of the polypeptide which 
5 retain any of the biological properties of the polypeptide, they are intended to be 
included within the scope of this invention. 

A "heterologous protein" refers to a protein not naturally produced in the 

cell. 

Two amino acid sequences are "substantially homologous" or "substantially 
1 0 similar" when greater than about 40% of the amino acids are identical, or greater 
than 60% are similar (functionally identical). Preferably, the similar or 
homologous sequences are identified by aUgnment using, for example, the GCG 
(Genetics Computer Group, Program Manual for the GCG Package, Version 7, 
Madison, Wisconsin) pileup program. 
1 5 The term "corresponding to" is used herein to refer to similar or 

homologous sequences, whether the exact position is identical or different from the 
molecule to which the similarity or homology is measured. A nucleic acid or 
amino acid sequence alignment may include spaces. Thus, the term 
"corresponding to" refers to the sequence similarity, and not the numbering of the 
2 0 amino acid residues or nucleotide bases. 

A "derivative" is understood to mean, for the purposes of the present 
invention, the products comprising, for example, modifications at the level of the 
primary structure, such as deletions of one or more residues, substitutions of one or 
more residues, and/or modifications at the level of one or more residues. The 

2 5 number of residues affected by the modifications may be, for example, from 1 , 2 or 

3 to 10, 20, or 30 residues. The term derivative also comprises the molecules 
comprising additional intemal or terminal parts, of a peptide nature or otherwise. 
They may be in particular active parts, markers, amino acids, such as methionine at 
position -1. The term derivative also comprises the molecules comprising 

3 0 modifications at the level of the tertiary structure (N-terminal end, and the like). 
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The term derivative also comprises sequences homologous to the sequence 
considered, derived from other cellular sources, and in particular from cells of 
human origin, or from other organisms, and possessing activity of the same type or 
of substantially similar type. Such homologous sequences may be obtained by 
hybridization experiments. The hybridizations may be performed based on nucleic 
acid libraries, using, as probe, the native sequence or a fragment thereof, under 
conventional stringency conditions (Maniatis et al., General Molecular Biology 
Techniques), or preferably xmder high stringency conditions. 

These derivatives may be generated for various purposes, such as in 
particular that of increasing the affinity of the peptide for its site(s) of interaction, 
that of enhancing its production levels, or that of conferring new pharmacokinetic 
and/or biological properties on it. 

Preferably, the derivatives share at least one structural homology with a 
polypeptide according to the invention and more preferably at least at the level of 
the regions bordering the bHLH domain, and conserve its biological property of 
transcriptional activator. The derivatives may also offer new properties to the 
claimed polypeptides (labelling, and the like), or enhance their properties (stability, 
biological activity and the like). 

hi accordance with the present invention there may be employed 
conventional molecular biology, microbiology, and recombinant DNA techniques 
within the skill of the art. Such techniques are explained fiiUy in the literature. 
See, e.g., Sambrook, Fritsch & Maniatis, Molecular Cloning: A Laboratory 
Manual Second Edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, New York (herein "Sambrook et al., 1989"); DNA Cloning: A Practical 
Approach Volumes I and H (D.N. Glover ed. 1985); Oligonucleotide Synthesis 
(M.J. Gait ed. 1984); Nucleic Acid Hybridization [B.D. Hames & S J. Higgins eds. 
(1985)]; Transcription And Translation [B.D. Hames & S.J. Higgins, eds. (1984)]; 
Animal Cell Culture [R.L Freshney, ed. (1986)]; Immobilized Cells And Enzymes 
[IRL Press, (1986)]; B. Perbal, A Practical Guide To Molecular Cloning (1984); 
F.M. Ausubel et al. (eds.). Current Protocols in Molecular Biology, John Wiley & 
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Sons, Inc. (1994), 

A gene encoding a polypeptide according to the invention, whether 
genomic DNA or cDNA, can be isolated from any source, particularly from a 
human cDNA or genomic library. General methods for obtaining such a gene are 
well known in the art, as described above (see, e.g., Sambrook et al., 1989, supra). 

Accordingly, any animal cell potentially can serve as the nucleic acid 
source for the molecular cloning of a gene according to the invention. The DNA 
may be obtained by standard procedures known in the art from cloned DNA {e.g., a 
DNA "library"), and preferably is obtained from a cDNA library prepared from 
tissues with high level expression of the protein (e.g., heart, pancreas and skeletal 
muscle cDNA, since these are the cells that evidence high levels of expression of a 
polypeptide according to the invention), by chemical synthesis, by cDNA cloning, 
or by the cloning of genomic DNA, or fragments thereof, purified from the desired 
cell (See, for example, Sambrook et aL, 1989, supra; Glover, D.M. (ed.), 1985, 
DNA Cloning: A Practical Approach, MRL Press, Ltd., Oxford, U.K. Vol. I, H). 
Clones derived from genomic DNA may contain regulatory and intron DNA 
regions in addition to coding regions; clones derived from cDNA will not contain 
intron sequences. Whatever the source, the gene should be molecularly cloned into 
a suitable vector for propagation of the gene. 

Once the DNA fragments are generated, identification of the specific DNA 
fragment containing the desired gene of the invention maybe accomplished in a 
number of ways. For example, DNA fragments may be screened by nucleic acid 
hybridization to a labeled probe (Benton and Davis, 1977, Science 196:180; 
Grunstein and Hogness, 1975, Proc. Natl. Acad. Sci. U.S.A. 72:3961). Those DNA 
fragments with substantial homology to the probe will hybridize. As noted above, 
the greater the degree of homology, the more stringent hybridization conditions can 
be used. In a specific embodiment. Northern hybridization conditions are used to 
identify mRNA sphcing variants of a gene according to the invention. 

Further selection can be carried out on the basis of the properties of the 
gene, e.g., if the gene encodes a protein product having the isoelectric. 
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electrophoretic, amino acid composition, or partial amino acid sequence of a 
polypeptide according to the invention as disclosed herein. Thus, the presence of 
the gene may be detected by assays based on the physical, chemical, or 
immunological properties of its expressed product. For example, cDNA clones, or 
DNA clones which hybrid-select the proper mRNAs, can be selected which 
produce a protein that, e.g., has similar or identical electrophoretic migration, 
isoelectric focusing or non-equilibrium pH gel electrophoresis behavior, proteolytic 
digestion maps, or antigenic properties as known for a polypeptide according to the 
invention. In a specific embodiment, the expressed protein is recognized by a 
polyclonal antibody that is generated against an epitope specific for human ngn3 
peptide. 

The present invention also relates to genes (e.g., cDNAs) encoding allelic 
variants, spUcing variants, analogs, and derivatives of a polypeptide according to 
the invention, that have the same or homologous functional activity as a 
polypeptide according to the invention, and homologs thereof fi-om other species. 
The production and use of derivatives and analogs related to polypeptides of the 
invention are within the scope of the present invention, Li a specific embodiment, 
the derivative or analog is functionally active, i.e., capable of exhibiting one or 
more functional activities associated with a full-length, wild-type Relax or hngn3 
polypeptide of the invention. 

A derivative of a polypeptide according to the invention can be made by 
aUering encoding nucleic acid sequences by substitutions, additions or deletions 
that provide for functionally equivalent molecules. Preferably, derivatives are 
made that have enhanced or increased functional activity relative to native Relax or 
hngnS polypeptide. 

Due to the degeneracy of nucleotide coding sequences, other DNA 
sequences which encode substantially the same amino acid sequence as a Relax 
gene or an hngn3 gene, including an amino acid sequence that contains a single 
amino acid variant, may be used in the practice of the present invention. These 
include but are not limited to allelic genes, homologous genes firom other species, 
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and nucleotide sequences comprising all or portions of a gene according to the 
invention that is altered by the substitution of different codons that encode the 
same amino acid residue within the sequence, thus producing a silent change. 
Likewise, the derivatives of the invention include, but are not limited to, those 
5 containing, as a primary amino acid sequence, all or part of the amino acid 
sequence of a Relax polypeptide or an hngn3 polypeptide including altered 
sequences in which functionally equivalent amino acid residues are substituted for 
residues within the sequence resulting in a conservative amino acid substitution. 
For example, one or more amino acid residues within the sequence can be 

1 0 substituted by another amino acid of a similar polarity, which acts as a functional 
equivalent, resulting in a silent alteration. Substitutes for an amino acid within the 
sequence may be selected from other members of the class to which the amino acid 
belongs. For example, the nonpolar (hydrophobic) amino acids include alanine, 
leucine, isoleucine, valine, proline, phenylalanine, tryptophan and methionine. 

15 Amino acids containing aromatic ring structures are phenylalanine, tryptophan, and 
tyrosine. The polar neutral amino acids include glycine, serine, threonine, cysteine, 
tyrosine, asparagine, and glutamine. The positively charged (basic) amino acids 
include arginine, lysine and histidine. The negatively charged (acidic) amino acids 
include aspartic acid and glutamic acid. Such alterations will not be expected to 

2 0 affect apparent molecular weight as determined by polyacrylamide gel 
electrophoresis, or isoelectric point. 

Particularly preferred substitutions are: 

- Lys for Arg and vice versa such that a positive charge may be maintained; 

- Glu for Asp and vice versa such that a negative charge may be maintained; 

2 5 - Ser for Thr such that a free -OH can be maintained; and 

- Gin for Asn such that a free CONH2 can be maintained. 

Amino acid substitutions may also be introduced to substitute an amino 
acid with a particularly preferable property. For example, a Cys may be introduced 
a potential site for disulfide bridges with another Cys. A His may be introduced as 

3 0 a particularly "catalytic" site {i.e.. His can act as an acid or base and is the most 
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common amino acid in biochemical catalysis). Pro may be introduced because of 
its particularly planar structure, which induces (3 -turns in the protein's structure. 

The genes encoding derivatives and analogs of the invention can be 
produced by various methods known in the art. The manipulations that result in 
5 their production can occur at the gene or protein level. For example, a cloned gene 
sequence according to the invention can be modified by any of numerous strategies 
known in the art (Sambrook et al., 1989, supra). The sequence can be cleaved at 
appropriate sites with restriction endonuclease(s), followed by further enzymatic 
S modification if desired, isolated, and ligated in vitro. In the production of the gene 

fl; 1 0 encoding a derivative or analog of a polypeptide according to the invention, care 

t! should be taken to ensure that the modified gene remains within the same 

p translational reading frame as the wild-type gene, uninterrupted by translational 

stop signals, in the gene region where the desired activity is encoded. 

Additionally, the nucleic acid sequence encoding a polypeptide according 
i ^ 15 to the invention can be mutated in vitro or in vivo^ to create and/or destroy 

translation, initiation, and/or termination sequences, or to create variations in 
coding regions and/or form new restriction endonuclease sites or destroy 
preexisting ones, to facilitate further in vitro modification. Preferably, such 
mutations enhance the fimctional activity of the mutated gene product. Any 
2 0 technique for mutagenesis known in the art can be used, including but not hmited 
to, in vitro site-directed mutagenesis (Hutchinson, C, et al, 1978, J. BioL Chem, 
253:6551; ZoUer and Smith, 1984, DNA 3:479-488; Oliphant et al, 1986, Gene 
44:177; Hutchinson et al., 1986, Proc. Natl. Acad. Sci. U.S.A. 83:710), use of 
TAB® linkers (Pharmacia), etc. PCR techniques m-e preferred for site directed 

2 5 mutagenesis (see Higuchi, 1989, "Using PCR to Engineer DNA", in PCR 

Technology: Principles and Applications for DNA Amplification^ H. Erlich, ed., 
Stockton Press, Chapter 6, pp. 61-70). 

The identified and isolated gene can then be inserted into an appropriate 
cloning vector. A large number of vector-host systems known in the art may be 

3 0 used. Possible vectors include, but are not limited to, plasmids or modified 
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viruses, but the vector system must be compatible with the host cell used. 
Examples of vectors include, but are not limited to, Escherichia coli, 
bacteriophages such as lambda derivatives, or plasmids such as pBR322 
derivatives or pUC plasmid derivatives, e,g. , pGEX vectors, pmal-c, pFLAG, etc. 
The insertion into a cloning vector can, for example, be accomplished by ligating 
the DNA fragment into a cloning vector that has complementary cohesive termini. 
However, if the complementary restriction sites used to fragment the DNA are not 
present in the cloning vector, the ends of the DNA molecules may be enzymatically 
modified. Alternatively, any site desired may be produced by ligating nucleotide 
sequences (linkers) onto the DNA termini; these ligated linkers may comprise 
specific chemically synthesized oUgonucleotides encoding restriction endonuclease 
recognition sequences. Recombinant molecules can be introduced into host cells 
via transformation, transfection, infection, electroporation, etc., so that many 
copies of the gene sequence are generated. Preferably, the cloned gene is contained 
on a shuttle vector plasmid, which provides for expansion in a cloning cell, e,g., E. 
coli, and purification for subsequent insertion into an appropriate expression cell 
line, if such is desired. For example, a shuttle vector, which is a vector that can 
replicate in more than one type of organism, can be prepared for repHcation in both 
E. coll and Saccharomyces cerevisiae by linking sequences from an E. coli plasmid 
with sequences form the yeast 2\x, plasmid. 

Nucleic Acids Encoding Relax and hngn3 Polypeptides 
The present invention relates to nucleic acids encoding novel basic helix- 
loop-heUx polypeptides. Relax and hngn3. Thus, a first subject of the invention is 
a nucleic acid comprising a polynucleotide sequence of a) SEQ ID NO: 1, or of a 
complementary polynucleotide sequence, orb) SEQ ID NO: 9, or of a 
complementary polynucleotide sequence. 

The invention also relates to a nucleic acid comprising at least 8 
consecutive nucleotides of a polynucleotide sequence of a) SEQ ID NO: 1, or of 
a complementary polynucleotide sequence, or b) SEQ ID NO: 9, or of a 
complementary polynucleotide sequence. 
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Preferably, a nucleic acid according to the invention comprises at least 
10, 12, 15, 18, 20 to 25, 35, 40, 50, 70, 80, 100, 200, or 500 consecutive 
nucleotides of a polynucleotide sequence of a) SEQ ID NO: 1, or of a 
complementary polynucleotide sequence, or b) SEQ ID NO: 9, or of a 
complementary polynucleotide sequence. 

The invention also relates to a nucleic acid having at least 80% nucleotide 
identity with a nucleic acid comprising a polynucleotide sequence of a) 
SEQ ID NO: 1, or of a complementary polynucleotide sequence, orb) SEQ ID NO: 
9, or of a complementary polynucleotide sequence. 

The invention also relates to a nucleic acid having at least 85%, preferably 
90%, more preferably 95% and still more preferably 98% nucleotide identity with a 
nucleic acid comprising a polynucleotide sequence of a) SEQ ID NO: 1, or of a 
complementary polynucleotide sequence, or b) SEQ ID NO: 9, or of a 
complementary polynucleotide sequence. 

The invention also relates to a nucleic acid hybridizing, under high 
stringency conditions, with a polynucleotide sequence of a) SEQ ID NO: 1, or of 
a complementary polynucleotide sequence, or b) SEQ ID NO: 9, or of a 
complementary polynucleotide sequence. 

The invention also relates to nucleic acids, particularly cDNA molecules, 
which encode the full length rat Relax or human ngn3 proteins. The present 
invention also relates to a cDNA molecule that encodes the novel Ml length rat 
Relax or human ngn3 proteins. Thus, the invention relates to a nucleic acid 
comprising a polynucleotide sequence of a) SEQ ID NO: 1 , or of a complementary 
polynucleotide sequence, or b) SEQ ID NO: 9, or of a complementary 
polynucleotide sequence. 

The invention also relates to a nucleic acid comprising a polynucleotide 
sequence as depicted in a) SEQ ID NO: 1, or of a complementary polynucleotide 
sequence, orb) SEQ ID NO: 9, or of a complementary polynucleotide sequence. 

According to the invention, a nucleic acid comprising a polynucleotide 
sequence of SEQ ID NO: 1 encodes a full length rat Relax polypeptide of 214 
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amino acids comprising the amino acid sequence of SEQ ID NO: 8. 

According to the invention, a nucleic acid comprising a polynucleotide 
sequence of SEQ ID NO: 9 encodes a full length human hngnS polypeptide of 214 
amino acids comprising the amino acid sequence of SEQ ID NO: 10. 

Relax and hngn3 Polypeptides According to the hivention 
The present invention also relates to rat Relax or human ngn3 (hngn3) 

polypeptides according to the invention. 

Thus, the present invention relates to a nucleic acid that encodes a 

polypeptide comprising an amino acid sequence of either one of SEQ ID NOs: 8 or 

10. 

The invention also relates to a polypeptide comprising an amino acid 
sequence as depicted in either one of SEQ ID NOs: 8 or 10. 

The invention also relates to a polypeptide comprising an amino acid 
sequence having at least 80% amino acid identity with a polypeptide comprising 
an amino acid sequence of either one of SEQ ID NOs: 8 or 10. 

The invention also relates to a polypeptide having at least 85%, preferably 
90%, more preferably 95%> and still more preferably 98% amino acid identity with 
a polypeptide comprising an amino acid sequence of either one of SEQ ID NOs: 8 
or 10. 

Preferably, a polypeptide according to the invention will have a length of 
15, 18 or 20 to 25, 35, 40, 50, 70, 80, 100 or 200 consecutive amino acids of a 
polypeptide according to the invention, in particular a polypeptide comprising an 
amino acid sequence of either one of SEQ ID NOs: 8 or 10. 

Altematively, a polypeptide according to the invention will comprise a 
fragment having a length of 15, 18, 20, 25, 35, 40, 50, 100 or 200 consecutive 
amino acids of a polypeptide according to the invention, more particularly of a 
polypeptide comprising an amino acid sequence of either one of SEQ ID NOs: 8 
or 10. 
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Expression of Polypeptides of the Invention and Methods of Producing 
Polypeptides of the Livention 
The invention relates to a method for the production of a polypeptide 
comprising an amino acid sequence of either one of SEQ ID NOs: 8 or 10, or of 
5 a polypeptide or a variant thereof, wherein said method comprises the steps of: 

a) inserting a nucleic acid encoding said polypeptide into an appropriate 

vector; 

b) culturing, in an appropriate culture medium, a previously transformed 
host cell or transfecting a host cell with the recombinant vector of step a); 

10 c) recovering the conditioned culture medium or lysing the host cell, for 

example by sonication or by osmotic shock; 

d) separating and purifying said polypeptide from said culture medium or 
alternatively from the cell lysates obtained in step c); and 

e) where appropriate, characterizing the recombinant polypeptide produced. 
15 A specific embodiment of the invention relates to a method for producing a 

polypeptide comprising an amino acid sequence of either one of SEQ ID NOs: 8 or 
10. 

A polypeptide termed "homologous" to a polypeptide having an amino acid 
sequence comprising either one of SEQ ID NOs: 8 or 10 also forms part of the 
2 0 invention. Such a homologous polypeptide comprises an amino acid sequence 

possessing one or more substitutions of an amino acid by an equivalent amino acid, 
relative to either one of SEQ ID NOs: 8 or 10, respectively. 

The polypeptides according to the invention may be characterized by 
binding to an immunoaffinity chromatography column on which the antibodies 

2 5 directed against this polypeptide or against a fragment or a variant thereof have 

been previously immobilized. 

According to another aspect, a recombinant pol3T>eptide according to the 
invention may be purified by passing it over an appropriate series of 
chromatography columns, according to methods known to persons skilled in the art 

3 0 and described for example in Ausubel et al. (1989). 
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A polypeptide according to the invention may also be prepared by 
conventional chemical synthesis techniques either in homogeneous solution or in 
solid phase. By way of illustration, a polypeptide according to the invention may 
be prepared by the technique either in homogeneous solution described by Houben 
5 Weyl, 1974 (In: Meuthode der Organischen Chemie, E. Wunsch Ed., 15-1:15-11) or 
the solid phase synthesis technique described by Merrifield, 1965 (Nature, 
207(996):522-523 and Science, 150(693):178-185). 

An "equivalent amino acid" according to the present invention will be 
understood to mean for example replacement of a residue in the L form by a 

1 0 residue in the D form or the replacement of a glutamic acid (E) by a pyro-glutamic 
acid according to techniques well known to persons skilled in the art. By way of 
illustration, the synthesis of peptide containing at least one residue in the D form is 
described by Koch, 1977 (Biochem. Biophys. Res. Commun., 74:488-491). 
According to another aspect, two amino acids belonging to the same class, that is 

15 to say two uncharged polar, nonpolar, basic or acidic amino acids, are also 
considered as equivalent amino acids. 

Polypeptides comprising at least one nonpeptide bond such as a retro- 
inverse bond (NHCO), a carba bond (CH2CH2) or a ketomethylene bond (CO-CH2) 
also form part of the invention. 

2 0 Preferably, the polypeptides according to the invention comprising one or 

more additions, deletions, substitutions of at least one amino acid will retain their 
capacity to be recognized by antibodies directed against the nonmodified 
polypeptides. 

The nucleotide sequence coding for a polypeptide according to the 

2 5 invention, or antigenic fragment, derivative or analog thereof, or a functionally 

active derivative, including a chimeric protein, thereof, can be inserted into an 
appropriate expression vector, z.e., a vector that contains the necessary elements for 
the transcription and translation of the inserted protein-coding sequence. Such 
elements are termed herein a "promoter." Thus, the nucleic acid of the invention is 

3 0 operationally associated with a promoter in an expression vector of the invention. 
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Both cDNA and genomic sequences can be cloned and expressed under control of 
such regulatory sequences. An expression vector also preferably includes a 
replication origin. 

The necessary transcriptional and translational signals can be provided on a 
5 recombinant expression vector, or they may be supplied by the native gene 
encoding a polypeptide according to the invention and/or its flanking regions. 
Potential host-vector systems include but are not hmited to mammalian cell 
systems infected with virus {e,g,, vaccinia virus, adenovirus, etc.); insect cell 
systems infected with virus {e,g., baculovirus); microorganisms such as yeast 

1 0 containing yeast vectors; or bacteria transformed with bacteriophage, DNA, 

plasmid DNA, or cosmid DNA. The expression elements of vectors vary in their 
strengths and specificities. Depending on the host-vector system utilized, any one 
of a number of suitable transcription and translation elements may be used. 
A recombinant polypeptide of the invention, or functional fragment, 

15 derivative, chimeric construct, or analog thereof, may be expressed 

chromosomally, after integration of the coding sequence by recombination. In this 
regard, any of a number of amplification systems may be used to achieve high 
levels of stable gene expression (See Sambrook et al., 1989, supra). 

The cell into which the recombinant vector comprising the nucleic acid 

2 0 encoding a polypeptide according to the invention is cultured in an appropriate cell 
culture medium under conditions that provide for expression of the polypeptide by 
the cell. Any of the methods previously described for the insertion of DNA 
fragments into a cloning vector may be used to construct expression vectors 
containing a gene consisting of appropriate transcriptional/translational control 

2 5 signals and the protein coding sequences. These methods may include in vitro 

recombinant DNA and synthetic techniques and in vivo recombination (genetic 
recombination) . 

A nucleic acid encoding a polypeptide according to the invention may be 
operably linked and controlled by any regulatory region, /.e, promoter/enhancer 

3 0 element known in the art, but these regulatory elements must be functional in the 
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host target tumor selected for expression. The regulatory regions may comprise a 
promoter region for functional transcription in the host cell, as well as a region 
situated 3' of the gene of interest, and which specifies a signal for termination of 
transcription and a polyadenylation site. All these elements constitute an 
5 expression cassette. 

Promoters that may be used in the present invention include both 
constitutive promoters and regulated (inducible) promoters. The promoter may be 
naturally responsible for the expression of the nucleic acid. It may also be from a 
heterologous source. In particular, it may be promoter sequences of eukaryotic or 

1 0 viral genes. For example, it may be promoter sequences derived fi-om the genome 
of the cell that it is desired to infect. Likewise, it may be promoter sequences 
derived from the genome of a virus, such as adenovirus (El A and MLP), 
cytomegalovirus (CMV), or Rous Sarcoma Virus (RSV). In addition, the promoter 
may be modified by addition of activating or regulatory sequences or sequences 

15 allowing a tissue-specific or predominant expression (enolase and GFAP promoters 
and the like). Moreover, when the nucleic acid does not contain promoter 
sequences, it may be inserted. 

Some promoters useful for practice of this invention are ubiquitous 
promoters (e.g.^ HPRT, vimentin, actin, tubulin), intermediate filament promoters 

2 0 (e.g., desmin, neurofilaments, keratin, GFAP), therapeutic gene promoters (e.g., 
MDR type, CFTR, factor VIH), tissue-specific promoters (e.g. , actin promoter in 
smooth muscle cells), promoters which are preferentially activated in dividing 
cells, promoters which respond to a stimulus (e.g., steroid hormone receptor, 
retinoic acid receptor), tetracycline-regulated transcriptional modulators, 

2 5 cytomegalovirus (CMV) immediate-early, retroviral LTR, metallothionein, S V-40, 

adenovirus Ela, and adenovirus major late (MLP) promoters. Tetracycline- 
regulated transcriptional modulators and CMV promoters are described in WO 
96/01313, US 5,168,062 and 5,385,839, the contents of which are incorporated 
herein by reference. 

3 0 More specifically, expression of a polypeptide according to the invention 



ST96042A US 



36 



may be controlled by any promoter/enhancer element known in the art, but these 
regulatory elements must be functional in the host selected for expression. 
Promoters which may be used to control gene expression include, but are not 
limited to, the SV40 early promoter region (Benoist and Chambon, 1981, Nature 
290:304-310), the promoter contained in the 3' long terminal repeat of Rous 
sarcoma virus (Yamamoto, et al., 1980, Cell 22:787-797), the herpes thymidine 
kinase promoter (Wagner et al., 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441- 
1445), the regulatory sequences of the metallothionein gene (Brinster et al., 1982, 
Nature 296:39-42); prokaryotic expression vectors such as the p-lactamase 
promoter (Villa-Kamaroff, et al., 1978, Proc. Natl. Acad. Sci. U.S.A. 75:3727- 
3731), or the tac promoter (DeBoer, et al., 1983, Proc. Natl. Acad. Sci. U.S.A. 
80:21-25); see also "Useful proteins jfrom recombinant bacteria" in Scientific 
American, 1980, 242:74-94; promoter elements from yeast or other fiingi such as 
the Gal 4 promoter, the ADC (alcohol dehydrogenase) promoter, PGK 
(phosphoglycerol kinase) promoter, alkaline phosphatase promoter; and the animal 
transcriptional control regions, which exhibit tissue specificity and have been 
utiUzed in transgenic animals: elastase I gene control region which is active in 
pancreatic acinar cells (Swift et al., 1984, Cell 38:639-646; Omitz et al., 1986, 
Cold Spring Harbor Symp. Quant. Biol. 50:399-409; MacDonald, 1987, 
Hepatology 7:425-515); insulin gene control region which is active in pancreatic 
beta cells (Hanahan, 1985, Nature 315:115-122), immunoglobulin gene control 
region which is active in lymphoid cells (Grosschedl et al., 1984, Cell 38:647-658; 
Adames et al., 1985, Nature 318:533-538; Alexander et al., 1987, Mol. Cell. Biol. 
7:1436-1444), mouse mammary tumor virus control region which is active in 
testicular, breast, lymphoid and mast cells (Leder et al., 1986, Cell 45:485-495), 
albumin gene control region which is active in liver (Pinkert et al., 1987, Genes 
and Devel. 1:268-276), alpha-fetoprotein gene control region which is active in 
liver (Krumlauf et al., 1985, Mol. Cell. Biol. 5:1639-1648; Hammer et al., 1987, 
Science 235:53-58), alpha 1-antitrypsin gene control region which is active in the 
liver (Kelsey et al., 1987, Genes and Devel. 1:161-171), beta-globin gene control 
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region which is active in myeloid cells (Mogram et al., 1985, Nature 315:338-340; 
KoUias et al., 1986, Cell 46:89-94), myelin basic protein gene control region which 
is active in oligodendrocyte cells in the brain (Readhead et al., 1987, Cell 48:703- 
712), myosin light chain-2 gene control region which is active in skeletal muscle 
5 (Sani, 1985, Nature 314:283-286), and gonadotropic releasing hormone gene 
control region which is active in the hypothalamus (Mason et aL, 1986, Science 
234:1372-1378). 

Expression vectors containing a nucleic acid encoding a polypeptide of the 
invention can be identified by five general approaches: (a) PCR amplification of 

1 0 the desired plasmid DNA or specific mRNA, (b) nucleic acid hybridization, (c) 
presence or absence of selection marker gene fimctions, (d) analyses with 
appropriate restriction endonucleases, and (e) expression of inserted sequences. In 
the first approach, the nucleic acids can be amplified by PCR to provide for 
detection of the amplified product. In the second approach, the presence of a 

1 5 foreign gene inserted in an expression vector can be detected by nucleic acid 
hybridization using probes comprising sequences that are homologous to an 
inserted marker gene. In the third approach, the recombinant vector/host system 
can be identified and selected based upon the presence or absence of certain 
"selection marker" gene fimctions (e,g,, p-galactosidase activity, thymidine kinase 

2 0 activity, resistance to antibiotics, transformation phenotype, occlusion body 
formation in baculovirus, etc.) caused by the insertion of foreign genes in the 
vector. In another example, if the nucleic acid encoding a polypeptide according to 
the invention is inserted within the "selection marker" gene sequence of the vector, 
recombinants containing the nucleic acid insert can be identified by the absence of 

2 5 the gene fimction. In the foxrrth approach, recombinant expression vectors are 

identified by digestion with appropriate restriction enzymes. In the fifth approach, 
recombinant expression vectors can be identified by assaying for the activity, 
biochemical, or immunological characteristics of the gene product expressed by the 
recombinant, provided that the expressed protein assumes a fimctionally active 

3 0 conformation. 
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A wide variety of host/expression vector combinations may be employed in 
expressing the DNA sequences of this invention. Useful expression vectors, for 
example, may consist of segments of chromosomal, non-chromosomal and 
synthetic DNA sequences. Suitable vectors include derivatives of SV40 and 
known bacterial plasmids, e,g., E, coli plasmids col El, pCRl, pBR322, pMal-C2, 
pET, pGEX (Smith et al, 1988, Gene 67:31-40), pMB9 and their derivatives, 
plasmids such as RP4; phage DNAS, e.g., the numerous derivatives of phage 1, 
e.g., NM989, and other phage DNA, e,g,. Ml 3 and filamentous single stranded 
phage DNA; yeast plasmids such as the 2m plasmid or derivatives thereof; vectors 
useful in eukaryotic cells, such as vectors useful in insect or mammaUan cells; 
vectors derived from combinations of plasmids and phage DNAs, such as plasmids 
that have been modified to employ phage DNA or other expression control 
sequences; and the like. 

For example, in a baculovirus expression systems, both non-fusion transfer 
vectors, such as but not limited to pVL941 (5amHl cloning site; Summers), 
pVL1393 {Bamm, Smal, Xba\ EcoKl, Notl, XmaJE, BglR, and PstI cloning site; 
hivitrogen), pVL1392 (BglR, Pstl, Not\ XmaJR, EcoRL, Xbal, Smal, and Bamm 
cloning site; Summers and hivitrogen), and pBlue^acIH (BamHl, BglR, Pstl, Ncol, 
and Hindm cloning site, with blue/white recombinant screening possible; 
Invitrogen), and fusion transfer vectors, such as but not limited to pAcVOO (BamRl 
and Kpnl cloning site, in which the BamUl recognition site begins with the 
initiation codon; Summers), pAc701 and pAc702 (same as pAc700, with different 
reading frames), pAc360 (BamHl cloning site 36 base pairs downstream of a 
polyhedrin initiation codon; Invitrogen(195)), and pBlueBacHisA, B, C [three 
different reading frames, with BamHl, BglR, Pstl, Ncol, and Hindm cloning site, 
an N-terminal peptide for ProBond purification, and blue/white recombinant 
screening of plaques; Invitrogen (220)] can be used. 

Mammalian expression vectors contemplated for use in the invention 
include vectors with inducible promoters, such as the dihydrofolate reductase 
(DHFR) promoter, e.g., any expression vector with a DHFR expression vector, or a 
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DHFR/mcthotTQxatc co-amplification vector, such as pED (PstI, Sail, Sbal, Smal, 
and EcoKl cloning site, with the vector expressing both the cloned gene and 
DHFR; see Kaufinan, Current Protocols in Molecular Biology^ 16.12 (1991). 
Alternatively, a glutamine synthetase/methionine sulfoximine co-amplification 
5 vector, such as pEE14 (HindUl, Xbal, Smal, Sbal, EcoWL^ and Bcli cloning site, in 
which the vector expresses glutamine synthase and the cloned gene; Celltech). hi 
another embodiment, a vector that directs episomal expression under control of 
Epstein Barr Virus (EBV) can be used, such as pREP4 {BamHl, Sfll, Xhol, Noil, 
Nhel, HindSl, Nhe\ PvwII, and Kpn\ cloning site, constitutive Rous Sarcoma Virus 

1 0 Long Terminal Repeat (RS V-LTR) promoter, hygromycin selectable marker; 

hivitrogen), pCEP4 (BamKl, Sfll, Xhol, Notl, Nhel, HindBl, Nhel, PvuU, and Kpnl 
cloning site, constitutive human cytomegalovirus (hCMV) immediate early gene, 
hygromycin selectable marker; Invitrogen), pMEP4 {Kpnl, Pvul, Nhel, HindEl, 
Noil, Xhol, Sfil, BamRl cloning site, inducible methallothionein Ha gene promoter, 

1 5 hygromycin selectable marker: Invitrogen), pREPS (BamB.1, Xhol, Notl, HindlR, 
Nhel, and Kpnl cloning site, RS V-LTR promoter, histidinol selectable marker; 
Invitrogen), pREP9 (Kpnl, Nhel, HindUl, Notl, Xhol, Sfil, and BamHI cloning site, 
RSV-LTR promoter, G418 selectable marker; hivitrogen), and pEBVHis (RSV- 
LTR promoter, hygromycin selectable marker, N-terminal peptide purifiable via 

2 0 ProBond resin and cleaved by enterokinase; hivitrogen). Selectable mammalian 
expression vectors for use in the invention include pRc/CMV {HindOL, BstXl, Notl, 
Sbal, and^pal cloning site, G418 selection; Invitrogen), pRc/RSV (HindlR, Spel, 
BstXl, Notl, Xbal cloning site, G418 selection; Invitrogen), and others. Vaccinia 
virus mammalian expression vectors (see, Kaufman, 1991, supra) for use 

2 5 according to the invention include but are not limited to pSCl 1 (Smal cloning site, 

TK- and p-gal selection), pMJ601 (Sail, Smal, AfR, Narl, BspMR, BamEI, Apal, 
Nhel, SacH, Kpnl, and HindUl cloning site; TK- and (3-gal selection), and 
pTKgptFlS (EcoRl, Pstl, Sail, Accl, HindR, Sbal, BamHI, and Hpa cloning site, 
TK or XPRT selection). 

3 0 Yeast expression systems can also be used according to the invention to 
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express a polypeptide according to the invention. For example, the non-fusion 
pYES2 vector (Xbal, Sphl, Shol, Notl, GstXl, EcoRl, BstXI, BamRl, Sad, Kpn\, 
and Hindni cloning sit; Invitrogen) or the fusion pYESHisA, B, C (Mai, Sphl, 
Shol, Notl, BstXl, EcoRl, BamHl, Sad, Kpnl, and HindlE cloning site, N-terminal 
5 peptide purified with ProBond resin and cleaved with enterokinase; Invitrogen), to 
mention just two, can be employed according to the invention. 

Once a particular recombinant DNA molecule is identified and isolated, 
several methods known in the art may be used to propagate it. Once a suitable host 
system and growth conditions are established, recombinant expression vectors can 

10 be propagated and prepared in quantity. As previously explained, the expression 
vectors which can be used include, but are not limited to, the following vectors or 
their derivatives: human or animal viruses such as vaccinia virus or adenovirus; 
insect viruses such as baculovirus; yeast vectors; bacteriophage vectors (e.g., 
lambda), and plasmid and cosmid DNA vectors, to name but a few. 

15 hi addition, a host cell strain may be chosen which modulates the 

expression of the inserted sequences, or modifies and processes the gene product in 
the specific fashion desired. Different host cells have characteristic and specific 
mechanisms for the translational and post-translational processing and 
modification of proteins. Appropriate cell lines or host systems can be chosen to 

2 0 ensure the desired modification and processing of the foreign protein expressed. 
Expression in yeast can produce a biologically active product. Expression in 
eukaryotic cells can increase the likelihood of "native" folding. Moreover, 
expression in mammalian cells can provide a tool for reconstituting, or 
constituting, activity of a polypeptide according to the invention. Furthermore, 

2 5 different vector/host expression systems may affect processing reactions, such as 

proteolytic cleavages, to a different extent. 

Vectors are introduced into the desired host cells by methods known in the 
art, e,g,, transfection, electroporation, microinjection, transduction, cell fusion, 
DEAE dextran, calcium phosphate precipitation, lipofection (lysosome fusion), use 

3 0 of a gene gun, or a DNA vector transporter (see, e.g., Wu et al., 1992, J. Biol. 
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Chem. 267:963-967; Wu and Wu, 1988, J. Biol. Chem. 263:14621-14624; Hartmut 
et al., Canadian Patent Application No. 2,012,3 1 1 , filed March 15,1 990). 

Soluble forms of the protein can be obtained by collecting culture fluid, or 
solubilizing inclusion bodies, e.g,, by treatment with detergent, and if desired 
sonication or other mechanical processes, as described above. The solubiUzed or 
soluble protein can be isolated using various techniques, such as polyacrylamide 
gel electrophoresis (PAGE), isoelectric focusing, 2-dimensional gel 
electrophoresis, chromatography {e.g., ion exchange, affinity, immunoaffmity, and 
sizing column chromatography), centrifugation, differential solubility, 
immunoprecipitation, or by any other standard technique for the purification of 
proteins. 

Antibodies to Polypeptides of the LiventionError! Bookmark not defined. 
A polypeptide according to the invention produced recombinantly or by 
chemical synthesis, and fragments or other derivatives or analogs thereof, including 
fusion proteins, may be used as an antigen or immunogen to generate antibodies. 
Preferably, the antibodies specifically bind a Relax or ngn3 polypeptide according 
to the invention. More preferably, the antibodies specifically bind human ngn3, 
but do not bind other forms of ngn3. 

A molecule is "antigenic" v^hen it is capable of specifically interacting with 
an antigen recognition molecule of the immime system, such as an 
immunoglobuhn (antibody) or T cell antigen receptor. An antigenic polypeptide 
contains at least about 5, and preferably at least about 10, amino acids. An 
antigenic portion of a molecule can be that portion that is immunodominant for 
antibody or T cell receptor recognition, or it can be a portion used to generate an 
antibody to the molecule by conjugating the antigenic portion to a carrier molecule 
for immunization. A molecule that is antigenic need not be itself immunogenic, 
i.e., capable of eUciting an immime response without a carrier. 

Such antibodies include but are not limited to polyclonal, monoclonal, 
chimeric, single chain, Fab fragments, and an Fab expression library. The 
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antibodies of the invention may be cross reactive, e.g., they may recognize Relax or 
ngn3 from different species. Polyclonal antibodies have greater likelihood of cross 
reactivity. Alternatively, an antibody of the invention may be specific for a single 
form of ngn3, such as human ngn3. Preferably, such an antibody is specific for 
5 human ngnS. 

Various procedures known in the art may be used for the production of 
polyclonal antibodies. For the production of antibody, various host animals can be 
immunized by injection with a polypeptide according to the invention, or a 
derivative {e.g., fragment or ftision protein) thereof, including but not limited to 

1 0 rabbits, mice, rats, sheep, goats, etc. In one embodiment, the polypeptide 
according to the invention or fragment thereof can be conjugated to an 
immunogenic carrier, e.g., bovine serum albumin (BSA) or keyhole limpet 
hemocyanin (KLH). Various adjuvants may be used to increase the immunological 
response, depending on the host species, including but not limited to Freund's 

15 (complete and incomplete), mineral gels such as aluminxmi hydroxide, surface 
active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil 
emulsions, keyhole limpet hemocyanins, dinitrophenol, and potentially useful 
human adjuvants such as BCG (bacille Calmette-Guerin) and Corynebacterium 
parvum. 

2 0 For preparation of monoclonal antibodies directed toward a polypeptide 

according to the invention, or fragment, analog, or derivative thereof, any 
technique that provides for the production of antibody molecules by continuous 
cell lines in culture may be used. These include but are not limited to the 
hybridoma technique originally developed by Kohler and Milstein [Nature 

2 5 256:495-497 (1975)], as well as the trioma technique, the human B-cell hybridoma 

technique [Kozbor et al.. Immunology Today 4:72 1983); Cote et ah, Proc, Natl. 
Acad. ScL U.S.A. 80:2026-2030 (1983)], and the EBV-hybridoma technique to 
produce human monoclonal antibodies [Cole et al, in Monoclonal Antibodies and 
Cancer Therapy, Alan R. Liss, hic, pp. 77-96 (1985)]. hi an additional 

3 0 embodiment of the invention, monoclonal antibodies can be produced in germ-free 
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animals [Memational Patent Publication No. WO 89/12690, published 28 
December 1989]. In fact, according to the invention, techniques developed for the 
production of "chimeric antibodies" [Morrison et al., J. Bacteriol 159:870 (1984); 
Neuberger et aL, Nature 312:604-608 (1984); Takeda et al.. Nature 314:452-454 
(1985)] by spUcing the genes from a mouse antibody molecule specific for a 
polypeptide according to the invention together with genes from a human antibody 
molecule of appropriate biological activity can be used; such antibodies are within 
the scope of this invention. Such human or humanized chimeric antibodies are 
preferred for use in therapy of human diseases or disorders (described infra\ since 
the human or humanized antibodies are much less likely than xenogenic antibodies 
to induce an immune response, in particular an allergic response, themselves. 

According to the invention, techniques described for the production of 
single chain Fv (scFv) antibodies [U.S. Patent Nos. 5,476,786 and 5,132,405 to 
Huston; U.S. Patent 4,946,778] can be adapted to produce polypeptide-specific 
single chain antibodies. An additional embodiment of the invention utilizes the 
techniques described for the construction of Fab expression libraries [Huse et aL, 
Science 246:1275-1281 (1989)] to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity for a polypeptide according 
to the invention, or its derivatives, or analogs. 

Antibody fragments that contain the idiotype of the antibody molecule can 
be generated by known techniques. For example, such fragments include but are 
not limited to: the F(ab')2 fragment which can be produced by pepsin digestion of 
the antibody molecule; the Fab' fragments which can be generated by reducing the 
disulfide bridges of the F(ab')2 fragment, and the Fab fragments which can be 
generated by treating the antibody molecule with papain and a reducing agent. 

In the production of antibodies, screening for the desired antibody can be 
accomplished by techniques known in the art, e.g., radioimmunoassay, ELISA 
(enzyme-linked immimosorbent assay), "sandwich" immunoassays, 
immunoradiometric assays, gel diffusion precipitin reactions, immunodifftision 
assays, in situ immunoassays (using colloidal gold, enzyme or radioisotope labels, 
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for example), western blots, precipitation reactions, agglutination assays (e.g., gel 
agglutination assays, hemagglutination assays), complement fixation assays, 
inamunofluorescence assays, protein A assays, and immunoelectrophoresis assays, 
etc. In one embodiment, antibody binding is detected by detecting a label on the 
primary antibody. In another embodiment, the primary antibody is detected by 
detecting binding of a secondary antibody or reagent to the primary antibody. In a 
further embodiment, the secondary antibody is labeled. Many means are known in 
the art for detecting binding in an immunoassay and are within the scope of the 
present invention. For example, to select antibodies that recognize a specific 
epitope of a polypeptide according to the invention, one may assay generated 
hybridomas for a product that binds to a polypeptide fi-agment conmprising such an 
epitope. For selection of an antibody specific to a polypeptide of the invention 
from a particular species of animal, one can select on the basis of positive binding 
with the polypeptide expressed by or isolated fi*om cells of that species of animal. 

The foregoing antibodies can be used in methods known in the art relating 
to the localization and activity of a polypeptide of the invention, e.g., for Westem 
blotting, imaging the polypeptide in situ, measuring levels thereof in appropriate 
physiological samples, etc, using any of the detection techniques mentioned above 
or known in the art. 

In a specific embodiment, antibodies that agonize or antagonize the activity 
of a Relax or hngnS polypeptide can be generated. Such antibodies can be tested 
using the assays for identifying Ugands. hi particular, such antibodies can be scFv 
antibodies expressed intracellularly. 

The present invention relates to an antibody directed against a polypeptide 
comprising an amino acid sequence of 

1) either one of SEQ ID NOs: 8 or 10, 

2) a polypeptide fi-agment or variant of a polypeptide comprising an amino 
acid sequence of either one of SEQ ID NOs: 8 or 10, or 

3) a polypeptide termed "homologous" to a polypeptide comprising either 
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one of SEQ ID NOs: 8 or 10, 

as produced in the trioma technique or the hybridoma technique described by 
Kozbor et al also forms part of the invention. 

The invention also relates to single-chain Fv antibody fragments (ScFv) as 
described in US patent No. 4,946,778 or by Martineau et al (J Mol Biol, 
280(1):1 17-127, 1998). 

The antibodies according to the invention also comprise antibody fragments 
obtained with the aid of phage libraries (Martineau et al. Ibid.) or humanized 
antibodies [Reimann et al (AIDS Res Hum Retroviruses, 13(1 1):933-943, 1997) 
and Leger et al (Hum Antibodies, 8(1):3-16, 1997)]. 

The antibody preparations according to the invention are usefiil in 
immunological detection tests intended for the identification of the presence and/or 
of the quantity of antigens present in a sample. 

An antibody according to the invention may comprise, in addition, a 
detectable marker that is isotopic or nonisotopic, for example fluorescent, or may 
be coupled to a molecule such as biotin, according to techniques well known to 
persons skilled in the art. 

Thus, another subject of the invention is a method of detecting the presence 
of a polypeptide according to the invention in a sample, said method comprising 
the steps of: 

a) bringing the sample to be tested into contact with an antibody directed 

against 

1) a polypeptide comprising an amino acid sequence of either one of 
SEQ ID NOs: 8 or 10, 

2) a polypeptide fragment or variant of a polypeptide comprising an 
amino acid sequence of either one of SEQ ID NOs: 8 or 10, or 

3) a polypeptide termed "homologous" to a polypeptide comprising 
either one of SEQ ID NOs: 8 or 10, and 

b) detecting the antigen/antibody complex formed. 

The invention also relates to a box or kit for diagnosis or for detecting the 
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presence of a polypeptide in accordance with the invention in a sample, said box 
comprising: 

a) an antibody directed against 

1) a polypeptide comprising an amino acid sequence of either one of 
SEQIDNOs:8or 10, 

2) a polypeptide fragment or variant of a polypeptide comprising an 
amino acid sequence of either one of SEQ ID NOs: 8 or 10, or 

3) a polypeptide termed "homologous" to a polypeptide comprising 
either one of SEQ ID NOs: 8 or 10, 

and 

b) a reagent allowing the detection of the antigen/antibody complex 

formed. 

Gene Therapy and Transgenic Vectors 
The invention also relates to a recombinant vector comprising a nucleic 

acid according to the invention. "Vector" for the purposes of the present invention 

will be understood to mean a circular or linear DNA or RNA molecule that is either 

in single-stranded or double-stranded form. 

Preferably, such a recombinant vector will comprise a nucleic acid selected 

from the group consisting of 

a) a nucleic acid comprising a polynucleotide sequence of either one of 
SEQ ID NOs: 1 or 9, or of a complementary polynucleotide sequence, 

b) a nucleic acid comprising a polynucleotide sequence as depicted in either 
one of SEQ ID NOs: 1 or 9, or of a complementary polynucleotide sequence, 

c) a nucleic acid having at least eight consecutive nucleotides of a nucleic 
acid comprising a polynucleotide sequence of 1) SEQ ID NO: 1, or of a 
complementary polynucleotide sequence, or 2) SEQ ID NO: 9, or of a 
complementary polynucleotide sequence, 

d) a nucleic acid having at least 80% nucleotide identity with a nucleic acid 
comprising a polynucleotide sequence of either one of SEQ ID NOs: 1 or 9, or of a 
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complementary polynucleotide sequence, 

e) a nucleic acid having 85%, 90%, 95%, or 98% nucleotide identity with a 
nucleic acid comprising a polynucleotide sequence of either one of SEQ ED NOs: 1 
or 9, or of a complementary polynucleotide sequence, 

f) a nucleic acid hybridizing, under high stringency hybridization 
conditions, with a nucleic acid comprising a polynucleotide sequence of either one 
of SEQ ID NOs: 1 or 9, or of a complementary polynucleotide sequence, and 

g) a nucleic acid encoding a polypeptide comprising an amino acid 
sequence of either one of SEQ ID NOs: 8 or 10. 

According to a first embodiment, a recombinant vector according to the 
invention is used to ampUfy a nucleic acid inserted therein, following 
transformation or transfection of a desired cellular host. 

According to a second embodiment, a recombinant vector according to the 
invention corresponds to an expression vector comprising, in addition to a nucleic 
acid in accordance with the invention, a regulatory signal or nucleotide sequence 
that directs or controls transcription and/or translation of the nucleic acid and its 
encoded mRNA. 

According to a preferred embodiment, a recombinant vector according to 
the invention will comprise in particular the following components: 

(1) an element or signal for regulating the expression of the nucleic acid to 
be inserted, such as a promoter and/or enhancer sequence; 

(2) a nucleotide coding region comprised within the nucleic acid in 
accordance with the invention to be inserted into such a vector, said coding region 
being placed in phase with the regulatory element or signal described in (1); and 

(3) an appropriate nucleic acid for initiation and termination of transcription 
of the nucleotide coding region of the nucleic acid described in (2). 

In addition, the recombinant vectors according to the invention may include 
one or more origins for replication in the cellular hosts in which their amplification 
or their expression is sought, markers or selectable markers. 

The present invention also includes gene therapy by the administration to a 
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patient of a nucleic acid encoding a polypeptide according to the invention. 
Preferably, the nucleic acid encodes a human ngn3 protein. 

The nucleic acids of the invention, where appropriate incorporated in 
vectors, and the pharmaceutical compositions comprising them, may be used for 
the treatment of many pathologies or dysfunction. They may be used for the 
transfer and expression of genes in vivo in any type of tissue, especially the nervous 
system. The treatment can, moreover, be targeted in accordance with the pathology 
or dysfunction to be treated (transfer to a particular tissue can, in particular, be 
determined by the choice of a vector, and expression by the choice of a particular 
promoter). The nucleic acids or vectors of the invention are advantageously used 
for the production in humans or animals, in vivo and intracellularly, of proteins 
capable of acting specifically on various cell functions such as cell proliferation, 
cell differentiation, neurogenesis, and the like. The present invention thus makes it 
possible to treat specifically, locally and effectively cell dysfunctions at the origin 
of or resulting from different pathologies affecting neurogenesis. 

As discussed above, a "vector" is any means for the transfer of a nucleic 
acid according to the invention into a host cell. Preferred vectors are viral vectors, 
such as retroviruses, herpes viruses, adenoviruses, and adeno-associated viruses. 
Thus, a gene encoding a polypeptide according to the invention or a fragment 
thereof is introduced in vivo, ex vivo, or in vitro using a viral vector or through 
direct introduction of DNA. Expression in targeted tissues can be effected by 
targeting the transgenic vector to specific cells, such as with a viral vector or a 
receptor ligand, or by using a tissue-specific promoter, or both. 

Expression vectors of the invention can be used, as pointed out above, both 
to transfect cells for screening or biological testmg of modulators of the 
polypeptides according to the invention, or for delivery of a gene or antisense gene 
according to the invention in vivo or ex vivo for gene therapy, e.g., to increase or 
decrease the level of activity of a polypeptide of the invention. A vector that 
expresses a scFv of the invention can also be introduced using the techniques 
discussed below. 
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Viral vectors commonly used for in vivo or ex vivo targeting and therapy 
procedures are DNA-based vectors and retroviral vectors. Methods for 
constructing and using viral vectors are known in the art [see, e.g.. Miller and 
Rosman, BioTechniques 7:980-990 (1992)]. Preferably, the viral vectors are 
replication defective, that is, they are unable to rephcate autonomously in the target 
cell. In general, the genome of the replication defective viral vectors which are 
used within the scope of the present invention lack at least one region which is 
necessary for the repHcation of the virus in the infected cell. These regions can 
either be eliminated (in whole or in part), be rendered non-functional by any 
technique known to a person skilled in the art. These techniques include the total 
removal, substitution (by other sequences, in particular by the inserted nucleic 
acid), partial deletion or addition of one or more bases to an essential (for 
rephcation) region. Such techniques may be performed in vitro (on the isolated 
DNA) or in situ, using the techniques of genetic manipulation or by treatment with 
mutagenic agents. Preferably, the rephcation defective virus retains the sequences 
of its genome that are necessary for encapsulating the viral particles. 

DNA viral vectors include an attenuated or defective DNA virus, such as 
but not limited to herpes simplex virus (HSV), papillomavirus, Epstein Barr virus 
(EBV), adenovirus, adeno-associated virus (AAV), vaccinia virus, and the like. 
Vectors derived from adenoviruses, retroviruses or AAVs incorporating 
heterologous nucleic acid sequences have been described in the literature [Akli et 
al.. Nature Genetics 3 (1993) 224; Stratford-Perricaudet et al.. Human Gene 
Therapy 1 (1990) 241; EP 185 573, Levrero et al.. Gene 101 (1991) 195; Le Gal La 
Salle et al., Science 259 (1993) 988; Roemer and Friedmarm, Eur. J. Biochem. 208 
(1992) 211; Dobson et al., Neuron 5 (1990) 353; Chiocca et aL, New Biol. 2 (1990) 
739; Miyanohara et al.. New Biol. 4 (1992) 238; WO91/18088]. 

The present invention therefore also relates to any recombinant virus 
comprising, inserted into its genome, a nucleic acid according to the invention. 

Defective viruses, which entirely or almost entirely lack viral genes, are 
preferred. The term "defective virus" designates a virus incapable of replicating in 
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the target cell. Generally, the genome of the defective viruses used in the context of 
the present invention therefore lacks at least sequences necessary for the replication 
of the virus in the infected cell. These regions maybe either removed (completely 
or partly), or made nonfunctional, or substituted by other sequences and in 
particular by the nucleic acid of the invention. Preferably, the defective virus 
conserves, nevertheless, the sequences of its genome that are necessary for the 
encapsidation of the viral particles. Defective virus is not replication competent 
after introduction into a cell, and thus does not lead to a productive viral infection. 
Use of defective viral vectors allows for administration to cells in a specific, 
localized area, without concern that the vector can infect other cells. Thus, a 
specific tissue can be specifically targeted. Examples of particular vectors include, 
but are not limited to, a defective herpes virus 1 (HSVl) vector [Kaplitt et al., 
Molec, Cell. NeuroscL 2:320-330 (1991)], defective herpes virus vector lacking a 
glyco-protein L gene [Patent PubUcation RD 371005 A], or other defective herpes 
virus vectors [Intemational Patent Publication No. WO 94/21807, pubUshed 
September 29, 1994; Mtemational Patent Pubhcation No. WO 92/05263, published 
April 2, 1994]; an attenuated adenovirus vector, such as the vector described by 
Stratford-Perricaudet et al. [J. Clin, Invest 90:626-630 (1992); see also Le Gal La 
Salle et al.. Science 259:988-990 (1993)]; and a defective adeno-associated virus 
vector [Samulski et aL, J. Virol 61:3096-3101 (1987); Samulski et al., J. Virol 
63:3822-3828 (1989); Lebkowski et al., Mol Cell Biol 8:3988-3996 (1988)]. 

Preferably, for in vivo administration, an appropriate immimosuppressive 
treatment is employed in conjunction with the viral vector, e.g., adenovirus vector, 
to avoid immuno-deactivation of the viral vector and transfected cells. For 
example, irmnunosuppressive cytokines, such as interleukin-12 (IL-12), interferon- 
y (DFN-y), or anti-CD4 antibody, can be administered to block humoral or cellular 
immune responses to the viral vectors [see, e.g., Wilson, Nature Medicine (1995)]. 
In addition, it is advantageous to employ a viral vector that is engineered to 
express a minimal number of antigens. 

Naturally, the invention contemplates delivery of a vector that will express 
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a therapeutically effective amount of a polypeptide of the invention for gene 
therapy applications. The phrase "therapeutically effective amount" is used herein 
to mean an amount sufficient to reduce by at least about 15 percent, preferably by 
at least 50 percent, more preferably by at least 90 percent, and most preferably 
prevent, a clinically significant deficit in the activity, function and response of the 
host. Altematively, a therapeutically effective amount is sufficient to cause an 
improvement in a clinically significant condition in the host. 

Adenovirus vectors 
In a preferred embodiment, the vector is an adenovirus vector. 
Adenoviruses are eukaryotic DNA viruses that can be modified to efficiently 
deUver a nucleic acid of the invention to a variety of cell types. Various serotypes 
of adenovirus exist, whose structure and properties vary somewhat. Of these 
serotypes, preference is given, within the scope of the present invention, to using 
type 2 or type 5 human adenoviruses (Ad 2 or Ad 5) or adenoviruses of animal 
origin (see W094/26914). Those adenoviruses of animal origin that can be used 
within the scope of the present invention include adenoviruses of canine, bovine, 
murine (example: Mavl, Beard et al.. Virology 75 (1990) 81), ovine, porcine, 
avian, and simian (example: SAV) origin. Preferably, the adenovirus of animal 
origin is a canine adenovirus, more preferably a CAV2 adenovirus (e.g. Manhattan 
or A26/61 strain (ATCC VR-800), for example). Preferably, adenoviruses of 
human or canine or mixed origin are used in the context of the invention. 

Preferably, the replication defective adenoviral vectors of the invention 
comprise the ITRs, an encapsidation sequence and the nucleic acid of interest. Still 
more preferably, at least the El region of the adenoviral vector is non-functional. 
The deletion in the El region preferably extends from nucleotides 455 to 3329 in 
the sequence of the Ad5 adenovirus (PvuH-Bglll fi-agment) or 382 to 3446 (Hinfll- 
Sau3 A firagment). Other regions may also be modified, in particular the E3 region 
(WO95/02697), the E2 region (W094/28938), the E4 region (W094/28152, 
W094/12649 and WO95/02697), or in any of the late genes L1-L5. The viral gene 
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considered can be made nonfunctional by any technique known to persons skilled 
in the art, and in particular by total suppression, by substitution or partial deletion 
or by addition of one or more bases in the gene(s) considered. 

In a preferred embodiment, the adenoviral vector has a deletion in the El 
5 region (Ad 1.0). Examples of El-deleted adenoviruses are disclosed in EP 
185,573, the contents of which are incorporated herein by reference. In another 
preferred embodiment, the adenoviral vector has a deletion in the El and E4 
regions (Ad 3.0). Examples of El/E4-deleted adenoviruses are disclosed in 
WO95/02697 and W096/22378, the contents of which are incorporated herein by 

1 0 reference. In still another preferred embodiment, the adenoviral vector has a 

deletion in the El region into which the E4 region and the nucleic acid sequence of 
interest is inserted (see FR94 13355, the contents of which are incorporated herein 
by reference). According to another preferred embodiment, the exogenous nucleic 
acid sequence is inserted at the level of the deletion in the El region. 

1 5 The rephcation defective recombinant viruses of the invention can be 

prepared by any technique known to the person skilled in the art (Levrero et al.. 
Gene 101 (1991) 195, EP 185 573; Graham, EMBO J. 3 (1984) 2917). In 
particular, they can be prepared by homologous recombination between a defective 
adenovirus and a plasmid that carries, inter alia, the DNA sequence of interest. 

2 0 The homologous recombination occurs after co-transfection of the adenovirus and 
plasmid into an appropriate cell line. The cell line which is employed should 
preferably (i) be transformable by the said elements, and (ii) comprise the nucleic 
acid sequences that are able to complement the defective virus genome part, 
preferably in integrated form in order to avoid the risks of recombination. 

2 5 Examples of cell lines which may be used for the preparation of defective 

recombinant adenoviruses are the human embryonic kidney cell line 293 [Graham 
et al., J. Gen. Virol. 36 (1977) 59] which contains the left-hand portion of the 
genome of an Ad5 adenovirus (12%) integrated into its genome, and cell lines 
which are able to complement the El and E4 fimctions, as described in applications 

3 0 W094/26914 and WO95/02697. Recombinant adenoviruses are recovered and 
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purified using standard molecular biological techniques, which are well known to 
one of ordinary skill in the art. 

Adeno-associated virus vectors 

The adeno-associated viruses (AAV) are DNA viruses of relatively small 
size that can integrate, in a stable and site-specific manner, into the genome of the 
cells which they infect. They are able to infect a wide spectrum of cells without 
inducing any effects on cellular growth, morphology or differentiation, and they do 
not appear to be involved in human pathologies. The AAV genome has been 
cloned, sequenced and characterised. It encompasses approximately 4700 bases 
and contains an inverted terminal repeat (ITR) region of approximately 145 bases 
at each end, which serves as an origin of rephcation for the virus. The remainder 
of the genome is divided into two essential regions which carry the encapsulation 
functions: the left-hand part of the genome, which contains the rep gene involved 
in viral rephcation and expression of the viral genes; and the right-hand part of the 
genome, which contains the cap gene encoding the capsid proteins of the virus. 

The use of vectors derived from the AAVs for transferring genes in vitro 
and in vivo has been described (see WO 91/18088; WO 93/09239; US 4,797,368, 
US 5,139,941, EP 488 528). These publications describe various AAV-derived 
constructs in which the rep and/or cap genes are deleted and replaced by a gene of 
interest, and the use of these constructs for transferring the said gene of interest in 
vitro (into cultured cells) or in vivo, (directly into an organism). The rephcation 
defective recombinant AAVs according to the invention can be prepared by 
cotransfecting a plasmid containing the nucleic acid sequence of interest flanked by 
two AAV inverted terminal repeat (ITR) regions, and a plasmid carrying the AAV 
encapsulation genes (rep and cap genes), into a cell line which is infected with a 
human helper virus (for example an adenovirus). The AAV recombinants that are 
produced are then purified by standard techniques. 

The invention also relates, therefore, to an AAV-derived recombinant virus 
whose genome encompasses a sequence encoding a nucleic acid encoding a 
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polypeptide of the invention flanked by the AAV ITRs. The invention also relates 
to a plasmid encompassing a sequence encoding a nucleic acid encoding a 
polypeptide of the invention flanked by two ITRs from an AAV. Such a plasmid 
can be used as it is for transferring the nucleic acid sequence, with the plasmid, 
where appropriate, being incorporated into a liposomal vector (pseudo- virus). 

Retrovirus vectors 
hi another embodiment, the gene can be introduced in a retroviral vector, 
e.g., as described in Anderson et al., U.S. Patent No. 5,399,346; Mann et al., 1983, 
Cell 33:153; Temin et al, U.S. Patent No. 4,650,764; Temin et al., U.S. Patent No. 
4,980,289; Markowitz et al., 1988, J. Virol. 62:1 120; Temin et al., U.S. Patent No. 
5,124,263; EP 453242, EP178220; Bernstein et al. Genet. Eng. 7 (1985) 235; 
McCormick, BioTechnology 3 (1985) 689; Litemational Patent Pubhcation No. 
WO 95/07358, pubUshed March 16, 1995, by Dougherty et al,; and Kuo et al., 
1993, Blood 82:845. The retroviruses are integrating viruses that infect dividing 
cells. The retrovirus genome includes two LTRs, an encapsulation sequence and 
three coding regions (gag, pol and env). In recombinant retroviral vectors, the gag, 
pol and env genes are generally deleted, in whole or in part, and replaced with a 
heterologous nucleic acid sequence of interest. These vectors can be constructed 
from different types of retrovirus, such as, HIV, MoMuLV ("murine Moloney 
leukaemia virus" MSV ("murine Moloney sarcoma virus"), HaSV ("Harvey 
sarcoma virus"); SNV ("spleen necrosis virus"); RSV ("Rous sarcoma virus") and 
Friend virus. Defective retroviral vectors are disclosed in WO95/02697. 

In general, in order to construct recombinant retroviruses containing a 
nucleic acid sequence, a plasmid is constructed which contains the LTRs, the 
encapsulation sequence and the coding sequence. This construct is used to 
transfect a packaging cell line, which cell line is able to supply in trans the 
retroviral functions that are deficient in the plasmid. In general, the packaging cell 
lines are thus able to express the gag, pol and env genes. Such packaging cell lines 
have been described in the prior art, in particular the cell line PA317 
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(US4,861,719); the PsiCRIP cell line (WO90/02806) and the GP envAm'12 cell 
line (WO89/07 1 50). By way of example of a line which can be used for the 
preparation of defective recombinant retroviruses, there may be mentioned the 
CRIP line (Danos and Mulligan, PNAS 85 (1988) 6460). Next, the viruses that 
have multiplied are recovered and purified according to conventional molecular 
biology techniques, hi addition, the recombinant retroviral vectors can contain 
modifications within the LTRs for suppressing transcriptional activity as well as 
extensive encapsulation sequences which may include a part of the gag gene 
(Bender et al., J. Virol. 61 (1987) 1639). Recombinant retrovural vectors are 
purified by standard techniques known to those having ordinary skill in the art. 

Retroviral vectors can be constructed to fimction as infections particles or 
to undergo a single roimd of transfection. In the former case, the virus is modified 
to retain all of its genes except for those responsible for oncogenic transformation 
properties, and to express the heterologous gene. Non-infectious viral vectors are 
prepared to destroy the viral packaging signal, but retain the structural genes 
required to package the co-introduced virus engineered to contain the heterologous 
gene and the packaging signals. Thus, the viral particles that are produced are not 
capable of producing additional virus. 

Targeted gene delivery is described in International Patent Publication WO 
95/28494, pubhshed October 1995. 

Non- viral vectors 

Altematively, the vector can be introduced in vivo by lipofection. For the 
past decade, there has been increasing use of liposomes for encapsulation and 
transfection of nucleic acids in vitro. Synthetic cationic lipids designed to limit the 
difficulties and dangers encountered with liposome mediated transfection can be 
used to prepare liposomes for in vivo transfection of a gene encoding a marker 
[Feigner, et. al., Proc, Natl. Acad. ScL U.S.A. 84:7413-7417 (1987); see Mackey, et 
al., Proc. Natl Acad. Set. U,S.A. 85:8027-8031 (1988); Ubner et al.. Science 
259:1745-1748 (1993)]. The use of cationic lipids may promote encapsulation of 
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negatively charged nucleic acids, and also promote fusion with negatively charged 
cell membranes [Feigner and Ringold, Science 337:387-388 (1989)]. Particularly 
useful lipid compounds and compositions for transfer of nucleic acids are described 
in International Patent Publications W095/18863 and W096/17823, and in U.S. 
Patent No. 5,459,127. The use of lipofection to introduce exogenous genes into the 
specific organs in vivo has certain practical advantages. Molecular targeting of 
liposomes to specific cells represents one area of benefit. It is clear that directing 
transfection to particular cell types would be particularly advantageous in a tissue 
with cellular heterogeneity, such as pancreas, liver, kidney, and the brain. Lipids 
may be chemically coupled to other molecules for the purpose of targeting [see 
Mackey, et. al., supra]. Targeted peptides, e.g., hormones or neurotransmitters, 
and proteins such as antibodies, or non-peptide molecules could be coupled to 
liposomes chemically. 

Other molecules are also useful for facihtating transfection of a nucleic acid 
in vivo, such as a cationic oligopeptide {e.g., Intemational Patent Pubhcation 
W095/21931), peptides derived from DNA binding proteins {e.g., kttemational 
Patent Pubhcation WO96/25508), or a cationic polymer {e.g., Intemational Patent 
Publication W095/21931). 

It is also possible to introduce the vector in vivo as a naked DNA plasmid 
(see U.S. Patents 5,693,622, 5,589,466 and 5,580,859), Naked DNA vectors for 
gene therapy can be introduced into the desired host cells by methods known in the 
art, e.g., transfection, electroporation, microinjection, transduction, cell fusion, 
DEAE dextran, calcium phosphate precipitation, use of a gene gun, or use of a 
DNA vector transporter [see, e.g., Wu et al., J. Biol Chem. 267:963-967 (1992); 
Wu and Wu, J. Biol Chem. 263:14621-14624 (1988); Hartmut et ah, Canadian 
Patent Application No. 2,012,31 1, filed March 15, 1990; Wilhams et al., Proc. 
Natl Acad. Set 88:2726-2730 (1991)]. Receptor-mediated DNA delivery 
approaches can also be used [Curiel et ah. Hum. Gene Ther. 3:147-154 (1992); Wu 
and Wu, J. Biol Chem. 262:4429-4432 (1987)]. Preferred naked DNA vectors 
include pCOR plasmids having a conditional origin of rephcation (see 
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WO97/10343), and minicircle plasmids lacking an origin of replication and a 
marker gene (see WO96/26270). 

Recombinant Host Cells of the Invention 

The present invention also relates to the use of cells genetically modified 
ex vivo with a vims according to the invention, or of cells producing such viruses, 
implanted in the body, allov^ing a prolonged and effective expression in vivo of a 
biologically active polypeptide according to the invention. 

The present invention shows that it is possible to incorporate a nucleic 
acid encoding a polypeptide according to the invention into a viral vector, and that 
these vectors make it possible to effectively express a biologically active, mature 
polypeptide. More particularly, the invention shows that the in vivo expression of a 
polypeptide according to the invention may be obtained by direct administration of 
an adenovirus or by implantation of a producing cell or of a cell genetically 
modified by an adenovirus or by a retrovirus incorporating such a nucleic acid. 

hi this regard, another subject of the invention relates to any isolated 
mammalian cell infected with one or more defective recombinant viruses according 
to the invention. More particularly, the invention relates to any population of 
human cells infected with these viruses. These may be in particular cells of blood 
origin (totipotent stem cells or precursors), fibroblasts, myoblasts, hepatocytes, 
keratinocytes, smooth muscle and endothelial cells, glial cells and the like. 

Another subject of the invention relates to an implant comprising isolated 
mammalian cells infected with one or more defective recombinant viruses 
according to the invention or cells producing recombinant viruses, and an 
extracellular matrix. Preferably, the implants according to the invention comprise 
10^ to lOlO cells. More preferably, they comprise 10^ to 10^ cells. 

More particularly, in the implants of the invention, the extracellular matrix 
comprises a gelling compound and optionally, a support allowing the anchorage of 
the cells. 

The invention also relates to an isolated recombinant host cell comprising 
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a nucleic acid of the invention, and more particularly, a nucleic acid comprising 
either one of SEQ JD NOs: 1 or 9, or of a complementary polynucleotide sequence. 

The invention also relates to an isolated recombinant host cell comprising a 
nucleic acid of the invention, and more particularly a nucleic acid comprising a 
5 nucleotide sequence as depicted in either SEQ ID NOs: 1 or 9, or of a 
complementary polynucleotide sequence. 

The invention also relates to an isolated recombinant host cell comprising a 
nucleic acid encoding a polypeptide comprising an amino acid sequence of either 
one of SEQ ID NOs: 8 or 10. 
1 0 According to another aspect, the invention also relates to an isolated 

recombinant host cell comprising a recombinant vector according to the invention. 
Therefore, the invention also relates to a recombinant host cell comprising a 
recombinant vector comprising any of the nucleic acids of the invention. 

Specifically, the invention relates to an isolated recombinant host cell 
1 5 comprising a recombinant vector comprising a nucleic acid comprising either one 
of SEQ ID NOs: 1 or 9, or of a complementary polynucleotide sequence. 

The invention also relates to an isolated recombinant host cell comprising a 
recombinant vector comprising a nucleic acid comprising a polynucleotide 
sequence as depicted in either one of SEQ ID NOs: 1 or 9, or of a complementary 
2 0 polynucleotide sequence. 

The invention also relates to an isolated recombinant host cell comprising a 
recombinant vector comprising a nucleic acid encoding a polypeptide comprising 
an amino acid sequence of either one of SEQ ID NOs: 8 or 10. 

The preferred host cells according to the invention are for example the 

2 5 following: 

a) prokaryotic host cells: strains of Escherichia coli (strain DHS-a), of 
Bacillus subtilis, of Salmonella typhimurium, or strains of genera such as 
Pseudomonas, Streptomyces and Staphylococus ; 

b) eukaryotic host cells: HeLa cells (ATCC No. CCL2), Cv 1 cells (ATCC 

3 0 No. CCL70), COS cells (ATCC No. CRL 1650), Sf-9 cells (ATCC No. CRL 
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1711), CHO cells (ATCC No. CCL-61) or 3T3 cells (ATCC No. CRL-6361). 

Pharmaceutical Compositions and Delivery 
The present invention also relates to pharmaceutical compositions. Such 
compositions may comprise a polypeptide, nucleic acid, recombinant vector, 
recombinant virus, recombinant host cell, or implant of the invention, as defined 
above, and a pharmaceutically acceptable excipient, carrier or vehicle. The 
compositions of the invention are particularly suitable for formulation of biological 
material for gene therapy. Thus, in a preferred embodiment, the composition 
comprises a nucleic acid encoding a human ngnS protein or polypeptide 
comprising amino acid sequence SEQ ID NO: 10. 

Any vector, viral or non- viral, of the invention will preferably be introduced 
in vivo in a pharmaceutically acceptable excipient, vehicle or carrier. The phrase 
"pharmaceutically acceptable" refers to molecular entities and compositions that 
are physiologically tolerable and do not typically produce an allergic or similar 
untoward reaction, such as gastric upset, dizziness and the like, when administered 
to a hximan. Preferably, as used herein, the term "pharmaceutically acceptable" 
means approved by a regulatory agency of the Federal or a state government or 
listed in the U.S. Pharmacopeia or other generally recognized pharmacopeia for use 
in animals, and more particularly in humans. The term "carrier" refers to a diluent, 
adjuvant, excipient, or vehicle with which the compound is administered. Such 
pharmaceutical carriers can be sterile hquids, such as water and oils, including 
those of petroleum, animal, vegetable or synthetic origin, such as peanut oil, 
soybean oil, mineral oil, sesame oil and the like. Water or aqueous solution saline 
solutions and aqueous dextrose and glycerol solutions are preferably employed as 
carriers, particularly for injectable solutions. Suitable pharmaceutical carriers are 
described in "Remington's Pharmaceutical Sciences" by E.W. Martin. 

The pharmaceutical compositions of the invention may be formulated for 
the purpose of topical, oral, parenteral, intranasal, intravenous, intramuscular, 
subcutaneous, intraocular, and the like, administration. 
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Preferably, the pharmaceutical compositions contain pharmaceutically 
acceptable vehicles for an injectable formulation. These can be, in particular, 
sterile, isotonic saline solutions (monosodium or disodium phosphate, sodium, 
potassium, calcium or magnesium chloride, and the like, or mixtures of such salts), 
5 or dry, in particular lyophilized, compositions which, on addition, as appropriate, 
of sterilized water or of physiological saline, enable injectable solutions to be 
formed. 

The compositions may be in particular isotonic, sterile, saline solutions 
(monosodiimi or disodium phosphate, sodium, potassium, calcium or magnesium 

1 0 chloride and the like or mixtures of such salts), or dry, especially freeze-dried 
compositions that upon addition, depending on the case, of sterihzed water or 
physiological saline, allow the constitution of injectable solutions. 

The preferred sterile injectable preparations can be a solution or suspension 
in a nontoxic parenterally acceptable solvent or diluent. Examples of 

1 5 pharmaceutically acceptable carriers or vehicles are saline, buffered saline, isotonic 
sahne (e.g., monosodium or disodium phosphate, sodium, potassium, calcium or 
magnesium chloride, or mixtures of such salts), Ringer's solution, dextrose, water, 
sterile water, glycerol, ethanol, and combinations thereof 1,3-butanediol and 
sterile fixed oils are conveniently employed as solvents or suspending media. Any 

2 0 bland fixed oil can be employed including synthetic mono- or di-glycerides. Fatty 
acids such as oleic acid also find use in the preparation of injectables. 

The doses of nucleic acids of the invention, either alone or incorporated in a 
vector, used for administration can be adjusted in accordance with different 
parameters, and in particular in accordance with the mode of administration used, 

2 5 the pathology in question, the gene to be expressed or the desired treatment period. 

Generally speaking, in the case of the recombinant viruses according to the 
invention, these are formulated and administered in the form of doses of between 
10"^ and 10^"^ pfu, and preferably 10^ to 10^^ pfii. The term pfii (plaque forming 
unit) corresponds to the infectious power of a solution of virus, and is determined 

3 0 by infection of a suitable cell culture and measurement, generally after 48 hours, of 
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the number of infected cell plaques. The techniques of determination of the pfu 
titre of a viral solution are well documented in the literature. 

The phrase "therapeutically effective amount" is used herein to mean an 
amount sufficient to reduce by at least about 15 percent, preferably by at least 50 
percent, more preferably by at least 90 percent, and most preferably prevent, a 
clinically significant deficit in the activity, fimction and response of the host. 
Alternatively, a therapeutically effective amoimt is sufficient to cause an 
improvement in a clinically significant condition in the host. 

The composition of the invention may be introduced parenterally or 
transmucosally, e.g., orally, nasally, vaginally, or rectally, or transdermally. 
Preferably, administration is parenteral, e.g., via intravenous injection, and also 
including, but is not limited to, intra-arteriole, intramuscular, intradermal, 
subcutaneous, intraperitoneal, intraventricular, and intracranial administration. The 
administration of the composition may be introduced by injection directly into the 
site to be treated, in particular, into the nervous system. 

The preferred route of administration to the nervous system is by direct 
injection. The nervous system can be imaged using any of the techniques available 
in the art, such as magnetic resonance imaging or computer-assisted tomography, 
and the therapeutic composition administered by stereotactic injection, for 
example. 

hi yet another embodiment, a composition comprising a human ngn3 
polypeptide, or nucleic acid encoding the polypeptide, can be delivered in a 
controlled release system. For example, the nucleic acid or polypeptide may be 
administered using intravenous infusion, an implantable osmotic pump, a 
transdermal patch, liposomes, or other modes of administration. In one 
embodiment, a pxmip may be used [see Langer, supra; Sefton, CRC CriL Ref. 
Biomed Eng, 14:201 (1987); Buchwald et aL, Surgery 88:507 (1980); Saudek et 
al., N. Engl. J. Med. 321:574 (1989)]. hi another embodiment, polymeric materials 
can be used [see Medical Applications of Controlled Release, Langer and Wise 
(eds.), CRC Press: Boca Raton, Florida (1974); Controlled Drug Bioavailability, 
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Drug Product Design and Performance, Smolen and Ball (eds.), Wiley: New York 
(1984); Ranger and Peppas, J. Macromol ScL Rev. Macromol Chem. 23:61 
(1983); see also Levy et al. Science 228:190 (1985); During et al., Ann. Neurol 
25:351 (1989); Howard et al., J.Neurosurg. 71:105 (1989)]. In yet another 
embodiment, a controlled release system can be placed in proximity of the 
therapeutic target, i.e., the heart, thus requiring only a fraction of the systemic dose 
{see, e.g., Goodson, in Medical Applications of Controlled Release, supra, vol. 2, 
pp. 1 15-138 (1984)]. Other controlled release systems are discussed in the review 
by Langer {Science 249:1527-1533 (1990)]. 

Thus, the compositions of the invention can be delivered by intravenous, 
intraarterial, intraperitoneal, intramuscular, or subcutaneous routes of 
administration. Alternatively, the compositions, properly formulated, can be 
administered by nasal or oral administration. A constant supply of the biological 
material can be ensured by providing a therapeutically effective dose {i.e., a dose 
effective to induce metabolic changes in a subject) at the necessary intervals, e.g., 
daily, every 12 hours, etc. These parameters will depend on the severity of the 
disease condition being treated, other actions, such as diet modification, that are 
implemented, the weight, age, and sex of the subject, and other criteria, which can 
be readily determined according to standard good medical practice by those of skill 
in the art. 

An organism in whom administration of a biological material within the 
scope of the invention is administered is preferably a human, but can be any 
animal. Thus, as can be readily appreciated by one of ordinary skill in the art, the 
methods and pharmaceutical compositions of the present invention are particularly 
suited to administration to any animal, particularly a mammal, and including, but 
by no means limited to, domestic animals, such as feline or canine subjects, farm 
animals, such as but not limited to bovine, equine, caprine, ovine, and porcine 
subjects, wild animals (whether in the wild or in a zoological garden), research 
animals, such as mice, rats, rabbits, goats, sheep, pigs, dogs, cats, etc., avian 
species, such as chickens, turkeys, songbirds, etc., i.e., for veterinary medical use. 
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The invention relates to the use of a polypeptide, nucleic acid, recombinant 
vector, recombinant virus, recombinant host cell, or implant according to the 
invention for the manufacture or preparation of a medicament or pharmaceutical 
composition intended for the prevention or treatment of a patient or subject 
suffering from a pathology, dysfunction, disorder, or condition affecting the 
nervous system. 

The invention relates to the use of a polypeptide according to the invention, 
preferably a polypeptide having an amino acid sequence of either one of SEQ ID 
NOs: 8 or 10 for the manufacture or preparation of a medicament or 
pharmaceutical composition intended for the prevention or treatment of a patient or 
subject suffering from a pathology, dysfunction, disorder, or condition affecting the 
nervous system. 

The invention also relates to the use of a nucleic acid according to the 
invention, preferably a nucleic acid encoding a polypeptide having an amino acid 
sequence of either one of SEQ ID NOs: 8 or 10 for the manufacture or preparation 
of a medicament or pharmaceutical composition intended for the prevention or 
treatment of a patient or subject suffering from a pathology, dysfimction, disorder, 
or condition affecting the nervous system. 

The invention also relates to the use of a recombinant vector according to 
the invention, preferably a recombinant vector comprising a nucleic acid encoding 
a polypeptide having an amino acid sequence of either one of SEQ ID NOs: 8 or 
1 0, for the manufacture or preparation of a medicament or pharmaceutical 
composition intended for the prevention or treatment of a patient or subject 
suffering from a pathology, dysfunction, disorder, or condition affecting the 
nervous system. 

The invention also relates to the use of a defective recombinant virus 
according to the invention, preferably a defective recombinant virus comprising a 
nucleic acid encoding a polypeptide having an amino acid sequence of either one of 
SEQ ID NOs: 8 or 10, for the manufacture or preparation of a medicament or 
pharmaceutical composition intended for the prevention or treatment of a patient or 
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subject suffering from a pathology, dysfunction, disorder, or condition affecting the 
nervous system. Thus, the present invention also relates to a pharmaceutical 
composition comprising one or more defective recombinant viruses according to 
the invention. 

The invention also relates to the use of a recombinant host cell according to 
the invention, preferably a recombinant host cell comprising a nucleic acid 
encoding a polypeptide having an amino acid sequence of either one of SEQ ID 
NOs: 8 or 10, for the manufacture or preparation of a medicament or 
pharmaceutical composition intended for the prevention or treatment of a patient or 
subject suffering from a pathology, dysftmction, disorder, or condition affecting the 
nervous system. 

The invention also relates to the use of an implant according to the 
invention, preferably an implant comprising a recombinant host cell comprising a 
nucleic acid encoding a polypeptide having an amino acid sequence of either one of 
SEQ ID NOs: 8 or 10, for the manufacture or preparation of a medicament or 
pharmaceutical composition intended for the prevention or treatment of a patient or 
subject suffering from a pathology, dysftmction, disorder, or condition affecting the 
nervous system. 

The present invention also relates to the use of cells genetically modified 
ex vivo with a virus according to the invention, or of producing cells such as 
viruses, implanted in the body, allowing a prolonged and effective expression 
in vivo of a biologically active polypeptide according to the invention, preferably a 
polypeptide having an amino acid sequence of either one of SEQ ID NOs: 8 or 10. 

The invention also relates to the use of a polypeptide according to the 
invention, preferably a polypeptide having an amino acid sequence of either one of 
SEQ ID NOs: 8 or 10, to control, influence, and/or participate in the expression of 
a gene. 

The invention also relates to the use of a nucleic acid according to the 
invention, preferably a nucleic acid comprising a polynucleotide sequence of either 
one of SEQ ID NOs: 1 or 9, for the manufacture or preparation of a medicament or 
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pharmaceutical composition intended for the prevention or treatment of a patient or 
subject suffering from a pathology, dysfunction, disorder, or condition affecting the 
nervous system. 

The invention also relates to the use of a recombinant vector according to 
the invention, preferably comprising a nucleic acid comprising a polynucleotide 
sequence of either one of SEQ ID NOs: 1 or 9, for the manufacture or preparation 
of a medicament or pharmaceutical composition intended for the prevention or 
treatment of a patient or subject suffering from a pathology, dysfimction, disorder, 
or condition affecting the nervous system. 

Nucleotide Probes and Primers of the Livention 
Nucleotide probes and primers hybridizing with a nucleic acid (genomic 
DNA, messenger RNA, cDNA) according to the invention also form part of the 
invention. 

The definition of a nucleotide probe or primer according to the invention 
therefore covers oHgonucleotides that hybridize, under the high stringency 
hybridization conditions defined above, with a polynucleotide sequence of a 
nucleic acid according to the invention, or a complementary polynucleotide 
sequence. 

According to the invention, nucleic acid fragments derived from a 
polynucleotide according to the invention are useful for the detection of the 
presence of at least one copy of a bHLH nucleic acid according to the invention, or 
of a fragment or of a variant (containing a mutation or a polymorphism) thereof in 
a sample. 

According to the invention, nucleic acid fragments derived from a nucleic 
acid comprising a polynucleotide sequence of either one of SEQ ID NOs: 1 or 9, or 
of a complementary polynucleotide sequence, are useful for the detection of the 
presence of at least one copy of a nucleotide sequence of a bHLH nucleic acid 
according to the invention, or of a fragment or of a variant (containing a mutation 
or a polymorphism) thereof in a sample. Nucleotide probes and primers 
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hybridizing with a nucleic acid sequence of a nucleic acid (genomic DNA, 
messenger RNA, cDNA) that encodes a polypeptide comprising an amino acid 
sequence comprising SEQ ID NO: 8 or SEQ ID NO: 10, also form part of the 
invention. 

The nucleotide probes or primers according to the invention comprise at 
least 8 consecutive nucleotides of a nucleic acid comprising a polynucleotide 
sequence of a) SEQ ED NO: 1, or of a complementary polynucleotide sequence, 
or b) SEQ ID NO: 9, or of a complementary polynucleotide sequence. 

Preferably, nucleotide probes or primers according to the invention will 
have a length of at least 10, 12, 15, 18, 20 to 25, 35, 40, 50, 70, 80, 100, 200, 
500, 1000, or 1500 consecutive nucleotides of a nucleic acid according to the 
invention, in particular of a nucleic acid comprising a polynucleotide sequence 
of a) SEQ ID NO: 1, or of a complementary polynucleotide sequence, orb) SEQ 
ID NO: 9, or of a complementary polynucleotide sequence. 

Alternatively, a nucleotide probe or primer according to the invention will 
consist of and/or comprise a fragment having a length of 8, 10, 12, 15, 18, 20, 25, 
35, 40, 50, 100, 200, 500, 1000, or 1500 consecutive nucleotides of a nucleic acid 
according to the invention, more particularly of a nucleic acid comprising a 
polynucleotide sequence of a) SEQ ID NO: 1, or of a complementary 
polynucleotide sequence, or b) SEQ ID NO: 9, or of a complementary 
polynucleotide sequence. 

The preferred probes and primers according to the invention comprise all 
or part of a polynucleotide sequence comprising any one of SEQ ID NOs: 1, 2, 
3,4,5,9, 11, 12, 14, 15, 16, 17, 18, 19,21,23,24, or 25, or of a complementary 
polynucleotide sequence. 

The nucleotide primers according to the invention may be used to amplify 
any one of the nucleic acids according to the invention, and more particularly a 
nucleic acid comprising a polynucleotide sequence of either one of SEQ ID NOs: 1 
or 9, or of a complementary polynucleotide sequence. 
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Alternatively, the nucleotide primers according to the invention may be 
used to amplify a nucleic acid fragment or variant of a nucleic acid comprising a 
polynucleotide sequence of either one of SEQ ID NOs: 1 or 9, or of a 
complementary polynucleotide sequence. 
5 A nucleotide primer or probe according to the invention may be prepared 

by any suitable method well known to persons skilled in the art, including by 
cloning and action of restriction enzymes or by direct chemical synthesis 
according to techniques such as the phosphodiester method by Narang et al 
(Methods Enzymol, 68:90-98, 1979) or by Brown et al (Methods Enzymol, 
10 68:109-151,1 979), the diethylphosphoramidite method by Beaucage et al 
(Tetrahedron Lett 22: 1859-1862, 1981) or the technique on a sohd support 
described in EU patent No.EP0,707,592. 

Each of the nucleic acids according to the invention, including the 
oligonucleotide probes and primers described above, maybe labeled, if desired, 
15 by incorporating a marker which can be detected by spectroscopic, 

photochemical, biochemical, immunochemical or chemical means. For 

32 33 3 35 

example, such markers may consist of radioactive isotopes ( P, P, H, S), 
fluorescent molecules (5-bromodeoxyuridine, fluorescein, acetylaminofluorene, 
digoxigenin) or Ugands such as biotin. The labeling of the probes is preferably 

2 0 carried out by incorporating labeled molecules into the polynucleotides by 

primer extension, or alternatively by addition to the 5' or 3 'ends. Examples of 
nonradioactive labeling of nucleic acid fragments are described in particular in 
French patent No.78 109 75 or in the articles by Urdea et al. (Nucleic Acids 
Research, 11:4937-4957, 1988) or Sanchez-Pescador et al. (J. Clin. Microbiol., 
25 26(10):1934-1938, 1988). 

Preferably, the nucleotide probes and primers according to the invention 
may have structural characteristics of the type to allow amplification of the signal, 
such as the probes described by Urdea et al. (Nucleic Acids Symp Ser., 24:197- 
200, 1991) or altematively in European patent No. EPO 225,807 (CHIRON). 

3 0 The oligonucleotide probes according to the invention may be used in 



ST96042A US 



68 



particular in Southern-type hybridizations with the genomic DNA or alternatively 
in hybridizations with the corresponding messenger RNA when the expression of 
the corresponding transcript is sought in a sample. 

The probes and primers according to the invention may also be used for the 
detection of products of PGR amphfication or altematively for the detection of 
mismatches. 

Nucleotide probes or primers according to the invention may be 
immobilized on a soUd support. Such soUd supports are well known to persons 
skilled in the art and comprise surfaces of wells of microtiter plates, polystyrene 
beds, magnetic beds, nitrocellulose bands or microparticles such as latex particles. 

Methods for AmpUfVine and Detecting Nucleic Acids Enc oding Polypeptides of 

the Invention 

The invention also relates to means for the amphfication and detection of 
nucleic acids according to the invention, preferably a nucleic acid encodmg a 
polypeptide comprising an amino acid sequence of either one of SEQ ID NO: 8 
or SEQ ID NO: 10, 

A preferred embodiment of the present invention relates to a method of 
ampUfying a nucleic acid according to the invention, and more particularly a 
nucleic acid comprising a polynucleotide sequence of SEQ ID NOs: 1 or 9, or of a 
complementary polynucleotide sequence; or a nucleic acid fragment or variant 
thereof contained in a sample, wherein said method comprises the steps of: 

a) bringing the sample in which the presence of the target nucleic acid is 
suspected into contact with a pair of nucleotide primers whose hybridization 
position is located respectively on the 5 'side and on the 3 'side of the region of the 
target nucleic acid whose amphfication is sought, in the presence of the reagents 
necessary for the amplification reaction; and 

b) detecting the amphfied nucleic acids. 

The present invention also relates to a method of detecting the presence of a 
nucleic acid in a sample, wherein the nucleic acid comprises a polynucleotide 
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sequence of SEQ ID NOs: 1 or 9, or of a complementary polynucleotide sequence; 
or a nucleic acid fragment or variant thereof, said method comprising the steps of: 

1) bringing one or more nucleotide probes according to the invention into 
contact with the sample to be tested; 

2) detecting the complex which may have formed between the probe(s) and 
the nucleic acid present in the sample. 

According to a specific embodiment of the method of detection according 
to the invention, the oligonucleotide probes and primers are immobihzed on a 
support. 

According to another aspect, the oUgonucleotide probes and primers 
comprise a detectable marker. 

The invention relates, in addition, to a box or kit for detecting the presence 
of a nucleic acid according to the invention in a sample, said box or kit comprising: 

a) one or more nucleotide probe(s) or primer(s) as described above; 

b) where appropriate, the reagents necessary for the hybridization reaction. 
According to a first aspect, the detection box or kit is characterized in that 

the probe(s) or primer(s) are immobilized on a support. 

According to a second aspect, the detection box or kit is characterized in 
that the oligonucleotide probes comprise a detectable marker. 

According to a specific embodiment of the detection kit described above, 
such a kit will comprise a plurality of oligonucleotide probes and/or primers in 
accordance with the invention which may be used to detect a target nucleic acid of 
interest or alternatively to detect mutations in the coding regions or the non-coding 
regions of the nucleic acids according to the invention. 

Another subject of the invention is a box or kit for ampHfying all or part of 
a nucleic acid comprising a polynucleotide sequence of SEQ ID NOs: 1 or 9, or of 
a complementary polynucleotide sequence, wherein said box or kit comprises: 

1) a pair of nucleotide primers in accordance with the invention, whose 
hybridization position is located respectively on the 5' side and 3' side of the 
target nucleic acid whose amphfication is sought; and optionally. 
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2) reagents necessary for an amplification reaction. 

Such an amplification box or kit will preferably comprise at least one pair 
of nucleotide primers as described above. 

The invention also relates to a box or kit for detecting the presence of a 
nucleic acid according to the invention in a sample, said box or kit comprising: 

a) one or more nucleotide probes according to the invention; 

b) where appropriate, reagents necessary for a hybridization reaction. 
According to a first aspect, the detection box or kit is characterized in that 

the nucleotide probe(s) and primer(s)are immobihzed on a support. 

According to a second aspect, the detection box or kit is characterized in 
that the nucleotide probe(s) and primer(s) comprise a detectable marker. 

According to a specific embodiment of the detection kit described above, 
such a kit will comprise a plurality of oligonucleotide probes and/or primers in 
accordance with the invention that may be used to detect target nucleic acids of 
interest. 

According to preferred embodiment of the invention, the target nucleic acid 
comprises a polynucleotide sequence of SEQ ID NOs: 1 or 9, or of a 
complementary polynucleotide sequence. 

Altematively, the target nucleic acid is a nucleic acid firagment or variant of 
a nucleic acid comprising a polynucleotide sequence of SEQ ID NOs: 1 or 9, or of 
a complementary polynucleotide sequence. 

According to a preferred embodiment, two primers according to the 
invention comprise all or part of SEQ ID NOs: 4 and 13, making it possible to 
amphfy the region of nucleotides 750-1460 of SEQ ID NO: 1 , or a nucleic acid 
having a complementary polynucleotide sequence. 

According to a preferred embodiment, two primers according to the 
invention comprise all or part of SEQ ID NOs: 5 and 13, making it possible to 
amphfy the region of nucleotides 777-1460 of SEQ ID NO: 1, or a nucleic acid 
having a complementary polynucleotide sequence. 

According to a preferred embodiment, two primers according to the 
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invention comprise all or part of SEQ ID NOs: 5 andl3, making it possible to 
amplify the region of nucleotides 640-1330 of SEQ ID NO: 9, or a nucleic acid 
having a complementary polynucleotide sequence. 

According to a preferred embodiment, two primers according to the 
invention comprise all or part of SEQ ID NOs: 5 and 25, making it possible to 
amplify the region of nucleotides 640-1330 of SEQ ID NO: 9, or a nucleic acid 
having a complementary polynucleotide sequence. 

According to a preferred embodiment, two primers according to the 
invention comprise all or part of SEQ ID NOs: 24 and 25, making it possible to 
amplify the region of nucleotides 1-1330 of SEQ ID NO: 9, or a nucleic acid 
having a complementary polynucleotide sequence. 

According to another preferred embodiment, a primer according to the 
invention comprises, generally, all or part of any one of SEQ ID NOs: 1, 2, 3, 4, 5, 
9, 11, 12, 14, 15, 16, 17, 18, 19, 21, 23, 24, or 25, or a complementary sequence. 

Thus, the probes according to the invention, immobihzed on a support, may 
be ordered into matrices such as "DNA chips". Such ordered matrices have in 
particular been described in US patent No. 5,143,854, in pubUshed PCX 
applications WO 90/15070 and WO 92/10092. 

Support matrices on which oligonucleotide probes have been immobilized 
at a high density are for example described in US patent No. 5,412,087 and in 
pubhshed PCT application WO 95/1 1995. 

The nucleotide primers according to the invention may be used to amplify 
any one of the nucleic acids according to the invention, or a complementary 
polynucleotide sequence. Alternatively, the nucleotide primers according to the 
invention may be used to amplify a nucleic acid fragment or variant of a nucleic 
acid according to the invention, or a complementary polynucleotide sequence. 

Another subject of the invention relates to a method of ampUfying a nucleic 
acid according to the invention, or a complementary polynucleotide sequence, 
contained in a sample, said method comprising the steps of: 

a) bringing the sample in which the presence of the target nucleic acid is 
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suspected into contact with a pair of nucleotide primers whose hybridization 
position is located respectively on the 5' side and on the 3' side of the region of the 
target nucleic acid whose amplification is sought, in the presence of the reagents 
necessary for the amplification reaction; and 
5 b) detecting the amplified nucleic acids. 

To carry out the amplification method as defined above, use will be 
preferably made of any of the nucleotide primers described above. 

The subject of the invention is, in addition, a box or kit for amplifying all or 
part of a nucleic acid according to the invention, or a complementary 
1 0 polynucleotide sequence, said box or kit comprising: 

a) a pair of nucleotide primers in accordance with the invention, whose 
hybridization position is located respectively on the 5' side and 3' side of the 
target nucleic acid whose amplification is sought; and optionally, 

b) reagents necessary for the amplification reaction. 

1 5 Such an amplification box or kit will preferably comprise at least one pair 

of nucleotide primers as described above. 

The invention also relates to a box or kit for detecting the presence of a 
nucleic acid according to the invention in a sample, said box or kit comprising: 
a) one or more nucleotide probes according to the invention; 
2 0 b) where appropriate, reagents necessary for a hybridization reaction. 

According to a first aspect, the detection box or kit is characterized in that 
the nucleotide probe(s) and primer(s)are immobilized on a support. 

According to a second aspect, the detection box or kit is characterized in 
that the nucleotide probe(s) and primer(s) comprise a detectable marker. 

2 5 According to a specific embodiment of the detection kit described above, 

such a kit will comprise a plurality of oligonucleotide probes and/or primers in 
accordance with the invention which may be used to detect target nucleic acids of 
interest or alternatively to detect mutations in the coding regions or the non-coding 
regions of the nucleic acids according to the invention. 

3 0 According to preferred embodiment of the invention, the target nucleic acid 
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comprises a polynucleotide sequence of SEQ ID NOs: 1 or 9, or of a 
complementary polynucleotide sequence. 

Alternatively, the target nucleic acid is a nucleic acid fragment or variant of 
a nucleic acid comprising a polynucleotide sequence of SEQ ID NOs: 1 or 9, or of 
5 a complementary polynucleotide sequence. 

According to a preferred embodiment, a primer according to the invention 
comprises, generally, all or part of any one of SEQ ED NOs: 1, 2, 3, 4, 5, 9, 11, 12, 
14, 15, 16, 17, 18, 19, 21, 23, 24, or 25, or a complementary sequence. 

10 Screening Assays 

Identification and isolation of a gene encoding a polypeptide of the 
invention provides for expression of the polypeptide in quantities greater than can 
be isolated from natural sources, or in indicator cells that are specially engineered 
to indicate the activity of a polypeptide of the invention expressed after transfection 

15 or transformation of the cells. Accordingly, in addition to rational design of 

agonists and antagonists based on the structure of a polypeptide of the invention, 
the present invention contemplates an alternative method for identifying specific 
Ugands of the polypeptide using various screening assays known in the art. 

Basic helix-loop-helix proteins play a role in cell differentiation and/or cell 

2 0 fate determination by regulating or inducing transcription of key genes. Therefore, 
agonists and antagonists of a polypeptide of the present invention will be expected 
to improve or decrease its transcription factor activity. 

The invention provides methods for screening small molecules and 
compounds that act on a basic helix-loop-helix protein to identify agonists and 

2 5 antagonists of Relax or hngn3 polypeptides that can improve, reduce, or inhibit 

neurogenesis, or a function of the nervous system, or a fiznction of the endocrine 
system, in particular, the pancreas from a therapeutic point of view. For example, 
hngn3 may be implicated in the transforming process that leads to some 
neuroectodermal and/or pancreatic tumors. In these cases, inhibiton of bHLH 

3 0 activity could be of great therapeutic value. Furthermore, activation of hngn3 may 
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be useful to generate in vitro or in vivo specific neuronal cells and endocrine 
pancreatic cells. It has recently been reported that the overexpression of ngn3 in 

32 

the developing pancreas yields an increase number of insulin producing cells. 
Thus, the activation of hngn3 polypeptides may be useful in the treatment of type 1 
5 diabetes mellitus. In general, these methods are useful to identify small molecules 
and compounds for therapeutic use in the treatment of a disease^ condition^ 
disorder, pathology, or dysfunction of the nervous system, or the endocrine system, 
specifically, the pancreatic organ. 

Thus, the present invention relates to a method of screening for antagonists 
10 or agonists of the polypeptides according to the invention. Preferably, the method 
of screening for antagonists or agonists of the polypeptides according to the 
invention is directed toward identifying an antagonist or agonist of a polypeptide 
comprising an amino acid sequence of either one of SEQ ID NO: 8 or SEQ ID NO: 
10. 

1 5 Therefore, the invention also relates to the use of a polypeptide according 

to the invention or a cell expressing a polypeptide according to the invention, for 
screening active ingredients for the prevention or treatment of a disease, 
condition, disorder, pathology, or dysfunction of the nervous system. An active 
site and oligopeptide or immunogenic fragments of a polypeptide according to 

2 0 the invention can serve for screening product libraries by a whole range of 

existing techniques. The polypeptide fragment used in this type of screening may 
be free in solution, bound to a solid support, at the cell surface or in the cell. The 
formation of the binding complexes between the polypeptide fragments and the 
tested agent can then be measured. 

2 5 Another product screening technique that may be used in high-flux 

screenings giving access to products having affinity for the protein of interest is 
described in application WO84/03564. In this method, applied to a polypeptide 
according to the invention, various products are synthesized on a solid surface. 
These products react with the polypeptide or fragment thereof and the complex is 

3 0 washed. The products binding the polypeptide are then detected by methods known 
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to persons skilled in the art. Non-neutralizing antibodies can also be used to 
capture a peptide and immobilize it on a support. 

Another possibility is to perform a product screening method using a 
neutralizing antibody competition, a polypeptide according to the invention, and a 
5 product potentially binding the polypeptide, hi this manner, the antibodies may be 
used to detect the presence of a peptide having a common antigenic unit with the 
polypeptide or protein according to the invention. 

Accordingly, this invention relates to the use of any method of screening 
;3 products, i.e,, compounds, small molecules, and the like, this being in all synthetic 

1 0 or cellular types, that is to say of mammals, insects, bacteria, or yeasts expressing 
constitutively or having incorporated a rat Relax or himian ngnS encoding nucleic 
¥: acid. 

The present invention also relates to the use of such a system for screening 
5 molecules that modulate the activity of a polypeptide according to the invention, 

f 15 Thus, the invention relates to methods of screening and identifying a modulator, 

□ agonist, or antagonist of a polypeptide according to the iirs^ention in a sample. 

™ Any screening technique known in the art can be used to screen for agonists 

or antagonists of the polypeptides of the invention. For example, a suitable isolated 
cell or cell line that expresses a polypeptide according to the invention, preferably 
2 0 human ngw J, can be transfected with a nucleic acid encoding a marker gene, such 
as p-galactosidase (p-gal). The isolated cell or cell line may express the 
polypeptide of the invention either naturally or after transfection of a nucleic acid 
encoding the polypeptide. The cell or cell hne is then exposed to a test solution or 
sample comprising a potential agonist or antagonist, and then stained for (3- 

2 5 galactosidase activity. The presence of more p-gal positive cells relative to control 

cells not exposed to the test solution or sample is an indication of the presence of 
an agonist in the test solution or sample. Conversely, the presence of less p-gal 
positive cells relative to control cells not exposed to the test solution or sample is 
an indication of the presence of an antagonist in the test solution or sample, 

3 0 The present invention relates to methods of identifying a modulator, 
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agonist, or antagonist of a polypeptide according to the invention in a sample 
comprising 

a) obtaining a cell, for example a cell line, that, either naturally or after 
transfecting the cell with a nucleic acid encoding a polypeptide according to the 

5 invention, expresses a polypeptide according to the invention, 

b) transfecting the cell with a nucleic acid encoding a marker gene, such as 
p-galactosidase (p-gal), 

c) incubating the cell of step b) with a test solution or sample comprising a 
potential modulator, agonist or antagonist, 

10 d) measuring the amount of p-galactosidase activity, and 

e) comparing the amount of p-galactosidase activity measured in step d) 
with an amount of p-galactosidase activity measured with a cell that has not been 
previously incubated in the presence of the candidate modulator, agonist, or 
antagonist compoimd for the polypeptide according to the invention. 

1 5 Another method of the present invention for identifying an agonist or 

antagonist of the ngn3 polypeptide is based on Applicant's discovery that ngn3 
polypeptide specifically activates its own promoter as shovra in the spinal cord and 
pancreas. Applicants have recognized that the expression of ngn3 is strictly 
dependent on the ngn3 protein. To date, no other members of the bHLH family 

2 0 have been found to activate or modulate the endogenous ngn3 promoter. 

According to the novel method in the present invention, the ngn3 responsive 
element located within the ngn3 promoter may be placed upstream of a reporter 
gene, for example, p-galactosidase, firefly luciferase, or green fluorescent protein, 
and be used as a specific marker to assay the level of activity of the ngn3 

2 5 polypeptide in the presence of an agonist or antagonist in a cell that expresses a 

known amount of ngn3 protein. The compound that modulates the endogenous 
activity of the ngn3 polypeptides will modify the expression of the reporter gene. 
The cell may be a primary culture cell or a cell line that either naturally or after 
transfecting with a nucleic acid encoding a ngn3 polypeptide according to the 

3 0 invention, expresses the polypeptide according to the invention. 
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In a specific embodiment, the polypeptide comprises an amino acid 
sequence of SEQ ID NO: 8 or SEQ ID NO: 10. 

According to a first aspect of the above screening methods, the cells used 
5 are cells naturally expressing a polypeptide according to the invention. Preferably, 
the cell expresses a polypeptide comprising an amino acid sequence of 
SEQ ID NO: 8 or SEQ ID NO: 10. The cells may be primary culture cells obtained 
ft^om a population of embryonic pancreatic epithelium and/or ventral spinal cord, 
for example. 

1 0 According to a second aspect, the cells used in the screening methods 

described above may be cells not naturally expressing, or alternatively expressing 
at a low level, a polypeptide according to the invention, said cells being transfected 
with a recombinant vector according to the invention capable of directing the 
expression of a nucleic acid encoding the polypeptide. Preferably, the cell 

1 5 expresses a polypeptide comprising an amino acid sequence of SEQ ID NO: 8 or 
SEQ ID NO: 10. 

According to a third aspect of the above screening methods, the polypeptide 
according to the invention may be a recombinantly produced polypeptide. 
Preferably, the recombinantly produced polypeptide comprises an amino acid 
2 0 sequence of SEQ ID NO: 8 or SEQ ID NO: 1 0. 

The present invention contemplates screens for small molecule ligands or 
ligand analogs and mimics, as well as screens for natural ligauds that bind to and 
agonize or antagonize a polypeptide of the invention in vivo. For example, natural 
products libraries can be screened using assays of the invention for molecules that 

2 5 agonize or antagonize the polypeptide's activity. 

Knowledge of the primary sequences of Relax and hngn3, and the similarity 
of these sequences with proteins of known fiinction, can provide an initial clue as 
to the inhibitors or antagonists of the polypeptides of the invention. Identification 
and screening of antagonists is further facilitated by determining structural features 

3 0 of the protein, e.g. , using X-ray crystallography, neutron diffi-action, nuclear 
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magnetic resonance spectrometry, and other techniques for structure determination. 
These techniques provide for the rational design or identification of agonists and 
antagonists. 

Another approach uses recombinant bacteriophage to produce large 
5 libraries. Using the "phage method" [Scott and Smith, 1990, Science 249:386-390 
(1990); Cwirla, et al, Proa Natl Acad, ScL, 87:6378-6382 (1990); Devlin et al, 

6 8 

Science^ 249:404-406 (1990)], very large libraries can be constructed (10 -10 
chemical entities). A second approach uses primarily chemical methods, of which 
the Geysen method [Geysen et al.. Molecular Immunology 23:709-715 (1986); 

1 0 Geysen et al. J. Immunologic Method 1 02:259-274 (1987)] and the method of 
Fodor et al. [Science 251:767-773 (1991)] are examples. Furka et al. [14th 
International Congress of Biochemistry, Volume 5, Abstract FR:013 (1988); Furka, 
InL J. Peptide Protein Res, 37:487-493 (1991)], Houghton [U.S. Patent No. 
4,631,21 1, issued December 1986] and Rutter et al. [U.S. Patent No. 5,010,175, 

15 issued April 23, 1991] describe methods to produce a mixture of peptides that can 
be tested as agonists or antagonists. 

In another aspect, synthetic libraries [Needels et al., Proc. Natl Acad, Sci 
USA 90:10700-4 (1993); Ohhneyer et al.. Proa Natl Acad Set USA 90:10922- 
10926 (1993); Lam et al., hitemational Patent Publication No. WO 92/00252; 

2 0 Kocis et al, Intemational Patent Publication No. WO 9428028, each of which is 
incorporated herein by reference in its entirety], and the like can be used to screen 
for ligands of the polypeptides according to the present invention. 

The screening can be performed with recombinant cells that express a 
polypeptide of the invention, or alternatively^ using purified protein, e.g., produced 

2 5 recombinantly, as described above. For example, labeled, soluble Relax or hngn3 

can be used to screen libraries, as described in the foregoing references. 

In one embodiment, a polypeptide of the invention may be directly labeled. 
In another embodiment, a labeled secondary reagent may be used to detect binding 
of the polypeptide of the invention to a molecule of interest, e.g,, a molecule 

3 0 attached to a solid phase support. Binding may be detected by in situ formation of 
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a chromophore by an enzyme label. Suitable enzymes include, but are not limited 
to, alkaline phosphatase and horseradish peroxidase. Jn a further embodiment, a 
two-color assay, using two chromogenic substrates with two enzyme labels on 
different acceptor molecules of interest, may be used. Cross-reactive and singly- 
5 reactive ligands may be identified with a two-color assay. 

Other labels for use in the invention include colored latex beads, magnetic 
beads, fluorescent labels (e.g., fluorescene isothiocyanate (FITC), phycoerythrin 
(PE), Texas red (TR), rhodamine, free or chelated lanthanide series salts, especially 
Eu^^, to name a few fluorophores), chemiluminescent molecules, radio-isotopes, or 

1 0 magnetic resonance imaging labels. Two color assays may be performed with two 
or more colored latex beads, or fluorophores that emit at different wavelengths. 
Labeled may be detected visually or by mechanical/optical means. 
Mechanical/optical means include fluorescence activated sorting, Le,, analogous to 
FACS, and micromanipulator removal means. 

15 As exemplified herein, the level of a polypeptide of the invention can be 

evaluated by metabolic labeling of the proteins. As the metabolic labeling occurs 
during in vitro incubation of the tissue biopsy in the presence of culture medium 
supplemented with [^^S]-methionine, the level of each of the markers detected may 
be affected by the in vitro conditions. In addition to metabolic (or biosynthetic) 

35 

2 0 labeling with [ S] -methionine, the invention fiirther contemplates labeling with 
[^"^CJ-amino acids and [^H]-amino acids (with the tritium substituted at non-labile 
positions). Thus, a sample or library of compounds can be directly analyzed after 
labeling of the proteins therein, e.g., by colorimetric staining using silver, gold, 
coomassie blue, or amido-schwartz, to mention a few techniques; isotopic labeling, 

32 125 131 

2 5 e.g., with [ P]-orthophosphate, [ I], [ I]; fluorescent or chemiluminescent tags; 

and immunological detection with labeled antibody or specific binding partner of a 
marker. 

The present invention may be better understood by reference to the 

3 0 following non-limiting Examples, which are provided as exemplary of the 
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invention. The examples and figures presented below by way of illustration and 
with no limitation being implied demonstrate other advantages and characteristics 
of the present invention. 

5 EXAMPLES 

General molecular biology techniques 

The methods traditionally used in molecular biology, such as preparative 
extractions of plasmid DNA, centriflxgation of plasmid DNA in a caesiimi chloride 

1 0 gradient, agarose or acrylamide gel electrophoresis, purification of DNA fragments 
by electroelution, protein extraction with phenol or phenol/chloroform, ethanol or 
isopropanol precipitation of DNA in a saline medium, transformation in 
Escherichia coli, and the like, are well known to a person skilled in the art and are 
amply described in the literature [Maniatis T. et aL, "Molecular Cloning, a 

15 Laboratory Manual", Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 
1982; (2"^ Ed. 1989); Ausubel F.M. et ah (eds), "Current Protocols in Molecular 
Biology", John Wiley & Sons, New York, 1987]. 

Conventional cloning vehicles include pBR322 and pUC type plasmids and 
phages of the Ml 3 series. These may be obtained commercially (Bethesda 

2 0 Research Laboratories) . 

For ligation, DNA fragments may be separated according to their size by 
agarose or acrylamide gel electrophoresis, extracted with phenol or with a 
phenol/chloroform mixture, precipitated with ethanol and then incubated in the 
presence of phage T4 DNA ligase (Biolabs) according to the supplier's 

2 5 recommendations. 

The filling in of 5' protruding ends maybe performed with the Klenow 
fragment of E. coli DNA polymerase I (Biolabs) according to the supplier's 
specifications. The destruction of 3' protruding ends is performed in the presence of 
phage T4 DNA polymerase (Biolabs) used according to the manufacturer's 

3 0 recommendations. The destruction of 5' protruding ends is performed by a 
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controlled treatment with SI nuclease. 

Mutagenesis directed in vitro by synthetic oligodeoxynucleotides may be 
performed according to the method developed by Taylor et al. [Nucleic Acids Res. 
13 (1985) 8749-8764] using the kit distributed by Amersham. 
5 The enzymatic amplification of DNA fragments by PCR [Polymerase- 

catalyzed Chain Reaction, Saiki R.K. et al., Science 230 (1985) 1350-1354; MuUis 
K.B. and Faloona F.A., Meth. Enzym. 155 (1987) 335-350] technique may be 
performed using a "DNA thermal cycler" (Perkin Elmer Cetus) according to the 
manufacturer's specifications. 
1 0 Verification of nucleotide sequences may be performed by the method 

developed by Sanger et al, [Proc. Natl. Acad. Sci. USA, 74 (1977) 5463-5467] 
using the kit distributed by Amersham. 

Plasmid DNAs may be purified by the Qiagen Plasmid Purification System 
according to the manufacture's instruction. 

15 

EXAMPLE 1 : Direct ampUfication by PCR of the sequences between the basic 
domains and Helix II with the aid of degenerate oligonucleotides. 

The isolation of the cDNA encoding new transcription factors of the bHLH 
family is carried out using rat SCG (Rat Superior Cervical Ganglia). Given that the 
2 0 proteins of this family have highly conserved amino acid units, in particular a basic 
domain followed by Helix I and Helix 2, degenerate ohgonucleotides encoding 
these various conserved regions are chosen. They were therefore selected so as to 
conserve the basic regions and the Helix n domain of the complex Drosophilia 
achaete-scute and the bHLH mammalian genes such as Mash-1, Mash-2 and M- 

2 5 Twist. Such oligonucleotides have been used respectively as sense and anti-sense 

primers on the one hand, to carry out direct screening of an SCG library 
constructed by PCR, and on the other hand in order to carry out PCRs on SCG ss- 
cDNAs. 

The sense oligonucleotide was chosen from the basic domain of these 

3 0 bHLH proteins; it is more precisely the oligonucleotide corresponding to the 
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following polypeptide fragment: 5'-AATKHGMGIGAGCGCIDKCGCRYG-3' 
(SEQIDN0:2). 

The anti-sense oligonucleotide, for its part, was determined from the Helix 
n domain of the same bHLH proteins. It is the oligonucleotide corresponding to the 
5 following polypeptide fragment: 5'-GGCSRDTYTCAGGGTSYBGAYCTT-3^ 
(SEQ ID NO: 3). 

The single-stranded cDNA (ss-cDNA) is prepared from 500 ng of 
denatured polyA^ RNA derived from the SCG of two-day-old rats. 

The PGR ampUfication is carried out on a mixture comprising, in addition 

10 to the PGR buffer (10 mM Tris, pH 8.3, 50 mM Kcl, 2.5 mM MgC12), 200 

dNTP, 12.5 pM of each ohgonucleotide and about 20 ng of single-stranded (ss)- 
cDNA, The PGR is carried out by performing successively 3 minutes of 
denaturation at 94^C; 2 cycles [94°C for 30 seconds(sec); 50°C for 45 sec; 72°c for 
1.5 minutes(min)] followed by 38 cycles (94°C for 30 sec; 62°C for 45 sec; 72°C 

15 for 1.5 min). The PGR products are cloned at the Smal site of pUG19 and the 1500 
clones thus obtained analyzed by sequencing. Applicants searched for the presence 
of a Helix I unit in the amino acid sequence deduced from the nucleotide sequence 
of these various cDNAs. A large proportion of these cDNAs (670) contain such a 
unit J and among them, only one corresponds to a new possible member of the 

2 0 bHLH family. The other cDNAs isolated correspond to known bHLH proteins. 

The new isolated cDNA, called CIG235 and represented in Figure 1, 
corresponds to a fragment of 76 nucleotides (excluding the sequences 
corresponding to the oligonucleotides introduced by PGR). The ahgnments of 
amino acid sequences were performed by authorizing neither insertions nor 

2 5 deletions. These alignments show that this fragment encodes a unit homologous to 
the Helix I unit of the bHLH proteins. The isolation of the complete cDNA was 
then undertaken. 



30 



EXAMPLE 2: Isolation of the complete cDNA corresponding to CIG235. 



The isolation of the complete cDNA corresponding to CIG235 was 
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extremely difficult because the mRNA corresponding to CIG235 is very little 
expressed in the SCG. It even appears that its expression level is close to that of the 
illegitimate transcription. The technique selected consists, on the one hand, in 
isolating the 3' end of the cDNA corresponding to CIG235 by anchored PGR and, 
5 on the other hand, in isolating the 5' end of this cDNA from the spinal cord of rat 
embryos at the El 2.5 stage. 

L Extension of the cDNA CIG235 in 3' by anchored PCR,. 

For that, SCG ss-cDNA were prepared with the RA3' NV primer (random 
1 0 reverse transcription primer carrying the 3' NV anchor) according to the Dumas 
Mihae Edwards et al. method (1995, A Practical Approach Ed McPherson et aL, 
pp89-118, IRL Press Oxford U.K.). Two specific nested primers are chosen in the 
bHLH positive clone of 76 nucleotides: CIG5'-1: 5'- 

AACCTTAACTCCGCGCTGGATGCGC-3' (SEQ E) NO: 4) and CIG5'-2: 5^- 

1 5 CGCGGTGTCCTGCCCACC-3' (SEQ DD NO: 5). 

Applicants then amplified the 3' end of CIG235 by two nested PCRs with 
the pairs of oHgonucleotides CIG5'1 and A3'-l and then CIG5'2 and A3*-2. The 
PGR mixture identified in Example 1 is used with only 0.8 [jM of each of the 
oligonucleotides and a denaturation step at 94*^C for 30 sec followed by an 

2 0 extension step at 72°C for 1 .5 min. The "annealing" is carried out for 45 sec with 
variations in temperature. During the first ten cycles, this temperature is reduced by 
1°C every two cycles fi*om 70°C to 65°C and then maintained at 65^C during 30 
cycles. The resulting products are cloned into pUC19 and sequenced. Applicants 
thus obtained a cDNA of 717 nucleotides, called CIG3', whose 5* part corresponds 

2 5 to the 3' end of the clone CIG235. Analysis of the open reading frames shows that 

the entire sequence encoding the carboxy-terminal end of the protein was isolated 
with 321 nucleotides of noncoding 3' region. 

2. Isolation of the 5' end corresponding to CIGS^from the spinal cord of rat 

3 0 embryos at the El 2, 5 stage. 
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Applicants sought out by in situ hybridization the pattern of expression of 
the mRNA corresponding to CIG3'. Its expression is estimated at various stages of 
the embryonic development. The method chosen is that of in situ hybridization on 
thick rat embiyo sections with a digoxygenin-labelled probe. This method is in fact 
5 an adaptation of that described for in toto hybridization (1994, A Practical 
Approach, Ed Wilkinson et al., pp75-83, IRL Press Oxford U.K.), and makes it 
possible to rapidly screen whole embryos at numerous different stages, 

A high expression of this mRNA was detected in the spinal cord, at 
embryonic day 12.5 (E12.5). Applicants therefore prepared mRNAs from spinal 
1 0 cord at this stage of development, and carried out the isolation of the 5' end by the 
sue method. A PGR product of 893 nucleotides is isolated and called CIG5\ 

The CIG3' and CIG5* cDNAs overlap over a region of 120 nucleotides and 
form a total cDNA whose size is 1491 nucleotides. 

However, given that CIG3* and CIG5' were not isolated from the same 
1 5 tissue, it was necessary to show that the contig generated by the alignment of these 
two clones corresponded to an existing mRNA and not to a chimera generated by 
PGR. Applicants therefore verified by PGR, after reverse transcription, that the 
total mRNA is present in the spinal cord at E12.5. hi this tissue, a single PGR 
amplification makes it possible to visuahze this mRNA in its entirety. Furthermore, 
2 0 analysis of the mRNAs by Northern blotting shows that all three of the probes 

GIG3', GIGS' and total cDNA make it possible to detect a single mRNA of 3.1 kb. 
Finally, in situ hybridization, the spatiotemporal expression pattern obtained with 
CIG3' or with the total cDNA is absolutely identical. These three results allow us to 
assert that GIG3* and GIGS* indeed correspond to two fragments of a same mRNA. 

2 5 In order to dispense with the point mutations introduced by Taq 

polymerase. Applicants carried out, on ss-cDNAs of the spinal cord at E12.5, four 
independent amplifications of the entire cDNA. Applicants were thus able to 
accurately determine the sequence of this cDNA that is represented in SEQ ID NO: 
1. 

3 0 Analysis of the cDNA sequence shows that Applicants isolated the entire 
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coding region, the noncoding 5' region as well as part of the untranslated 3' region. 
Furthermore, the expression of the corresponding mRNA is detected in the embryo 
in the form of longitudinal bands. This characteristic led to this cDNA being 
designated Relax^ for Rat embryonic longitudinal axis. 

5 

EXAMPLE 3: Comparison of amino acid sequences of the bHLH domain of Relax 
with other proteins of the BHLH family. 

Analysis of the amino acid sequence deduced from the Relax cDNA clearly 
indicates that this protein is part of the bHLH family. The bHLH domain of Relax 

1 0 shares 68% identity with the bHLH unit of the protein NeuroD (Lee et aL, 1 995). 
Furthermore, it appears that Relax shares more sequence identities with proteins 
homologous to (Math-1, Akazawa et al., 1995) than with those homologous to AS- 
C (Mash-1, Johnson et aL, 1990) (Figure 2). However, the total absence of 
sequence identity, outside the bHLH domain, with the other members of the family 

1 5 reveals that Relax is not the paralogue of an already identified protein. For this 
reason. Relax probably corresponds to the first member of a new subfamily of 
bHLH proteins. 

EXAMPLE 4: Characterization of the transcriptional activity of the Relax protein. 
2 0 The amino acid sequence analyses indicate that Relax shares a common 

structure with proteins of the bHLH family. These proteins are transcription 
factors. It is therefore important to demonstrate that the Relax cDNA encodes a 
protein whose characteristics are those of a transcription factor, that is to say that 
Relax is capable of binding to DNA on the target sites of bHLH factors, and that it 

2 5 can regulate transcription. 

1 . Proteins of the bHLH family form homo- or heterodimers and bind 
specifically to DNA (Murre et al, 1989, Cell 558, 537-44). 

3 0 The in vitro characterization of the DNA binding sites has allowed the 
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identification of a consensus hexanucleotide sequence, CANNTG, called E box 
(Murre et al., 1989). Applicants have shown, by gel migration retardation 
experiments, that the purified recombinant Relax protein is capable of binding 
specifically to a sequence containing an E box. To do this, the DNA encoding the 
5 Relax protein is inserted between the Ndel and Kpnl sites into a vector pFLAG- 
CTC. After expression of the protein in Escherichia coli, the fusion protein Relax- 
Flag is purified on an affinity colunm and then used in a binding reaction in the 
presence of an oligonucleotide containing an E box. 

10 2. The transcriptional activity of the Relax protein, for its part, was tested by 

cotransfection experiments in the cell line PCI 2, derived from rat 

pheochromocytomas. 

For that, Applicants used a luciferase reporter gene placed xmder the 

control, on the one hand, of a promoter containing an E box (5 kb of the TH 
1 5 promoter) and, on the other hand, of the same promoter containing a mutated E 

box. 

In parallel, Applicants prepared a vector for expressing the Relax protein as 
follows: 

(a) The cDNA encoding the Relax protein (firom position 389 to 1235) is 
2 0 inserted between the EcoRI and Xhol sites into the expression vector pcDNA3 

downstream of the CMV promoter. The rat TH promoter carrying an E box 
CAGGTG (SEQ ID NO: 6) at position 197 on a 5 kbp firagment is inserted into the 
blunt ended Hindin site in the plasmid pKSLuc which becomes 5kbTH-Luc. The 
site is mutated into TCCGTG (SEQ ID NO: 7) and the resulting plasmid 

2 5 designated 5kbTH(Delta E)-Luc. 

(b) The plasmid pcDNA3-Relax is cotransfected in various quantities 
ranging firom 0 to 5 pmol, and brought to 5 pmol with empty vector pcDNA3, in 
PC 12 cells (10^ cells/plate) mixed with 1 pmol of plasmid 5kbTH-Luc, 5kb- 
TH(deltaE)-Luc or pKSLuc. To evaluate the transfection efficiency, 0,2 pmol of a 

3 0 vector for expressing the gene encoding the CAT (chloramphenicol 
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acetyltransferase) protence under the control of the RSV promoter are also 
cotransfected. 

Measurement of the luciferase activity, after cotransfection of these vectors 
with increasing quantities of a vector for expressing the Relax protein, 
5 demonstrates that Relax activates the transcription of the reporter gene depending 
on the quantity of expression vector used and that this activation strictly requires 
the presence of an intact E box, 

EXAMPLE 5: Characterization of the sites for expression of the Relax mRNA 

10 during embryonic development. 

Applicants systematically analysed, by in situ hybridization, the distribution 
of the Relax mRNA in rats during embryonic development. To do this, in situ 
hybridizations in the rat embryo are carried out according to the Wilkinson method 
with anti-sense riboprobes either in toto or on sections of 250 jxm. The riboprobes 

15 are prepared in the presence of 1 g of linearized plasmid and in the presence of 3 .5 
nmol of digoxygenin-l 1-UTP using the Promega riboprobe kit. This detailed 
analysis shows that the expression of the Relax mRNA is transitory and spatially 
restricted. 

This mRNA is only present in the embryo, between days 11.5 and 1 8.5 of 
2 0 development. Its expression is restricted to the central nervous system. Indeed, no 
expression was detected, either in the peripheral nervous system, or outside of the 
nervous system. Inside the CNS, the Relax mRNA is expressed only in the spinal 
cord, the posterior brain and the anterior brain. 

More precisely, the Relax mRNA is expressed in longitudinal 

2 5 compartments in the spinal cord and the posterior brain. 

Its distribution in the spinal cord is restricted therein to two continuous 
bands of bilateral symmetry, longitudinally organized and restricted, on the DA^ 
axis, to the basal lamina, one at its ventral end, the other at its dorsal end. 

In the anterior brain, the Relax mRNAs are located in two groups of cells 

3 0 situated on either side of the optic recess, in the ventral region of the hypothalamus 
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and in the preoptic area. The predominant group of cells expressing Relax is 
situated in the basal part. Only the few cells of the preoptic area are situated in the 
wing region. The situation in the anterior brain is similar to that in the spinal cord 
and the posterior brain, hideed, the cells expressing Relax are, in all cases, strictly 
5 located in the ventricular zone. 

EXAMPLE 6: Characterization of the moments of expression of the Relax mRNA 
during embryonic development. 

All the sites of expression of Relax simultaneously belong to El 1 .5. They 

1 0 disappear progressively from the spinal cord between El 6.5 and El 8.5, according 
to a ventrodorsal gradient that follows the progressive reduction of the ventricular 
zone, hi the posterior brain, the expression of Relax sv^tches off, at the same time 
as that of the median band in the spinal cord. Finally, in the anterior brain, the 
expression of Relax disappears after El 8.5. It is important to underline that the 

1 5 expression of Relax is confined to the ventricular zone and that the temporal 
window for expression coincides with that for the production of the neurons in 
these different structures. 

EXAMPLE 7: Isolation of a new cDNA encoding the human neurogenin 3 
(hnsnS) homolog. 

Applicants have shown that the overexpression of Relax, the rat homolog 
of ngn3^ mXenopus embryo, induces the ectopic production of neurons 
throughout the ectoderm, thereby demonstrating that ngn3 has a neuronal 
determination function (Examples 1-6 supra and ^^). Here, Applicants report the 
isolation of the human neurogenin 3 homolog (hngnS). 

PCR primers: All PCR primers were chosen with the Oligo 4.0 software 
program. The nucleotide position of all the primers listed below is given relative 
to the genomic hngn3 sequence (SEQ ID NO: 9). 

ss-cDNA synthesis: Randomly primed ss-cDNAs were synthesized using total 
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RNA extracted from the spinal cord of a 7- week-old human embryo. Total RNA 
was prepared with RNAble (Eurobio) based on a guanidine/phenol extraction 

27 

procedure described by Chomczynski and Saachi . The ss-cDNAs were 
synthesized from 2|iig of denatured total RNA in the following reaction mix: 0.4 
jaM random hexamer, 2.5 mM dNTP, 4mM MgCl2, 10 mM Tris, pH 9, 50 mM 
KCl, 0.1% Triton X-100, 0.01% gelatin, 10 mM DTT, 8 U RNasin and 2.4 U 
AMV RTase. The mixture was incubated at 42^C for 1 hour. 

Amplification of the hngn3 bHLH domain: PCR was performed using two 
specific primers deduced from the nucleotide sequences of Relax and ngn3 
corresponding to the basic domain: hngnSB (nucleotides 585-602) 5'- 
CAACGACCGCGAGCGCAA-3' (SEQ ID NO: 1 1) and to the helix H domain: 
hngnSHII (nucleotides 722-699) 5'-GCCCAGATGTAGTTGTGGGCGAAG-3' 
(SEQ ID NO: 12). The PCR mix consisted of 10 mM Tris pH 9, 50 mM KCl, 
0.1% Triton X-100, 0.01% gelatin, 1.5 mM MgCh, 0.8 jaM of both primers, 200 
jiM dNTP, 2.5 U Taq polymerase and one tenth of the previously synthesized ss- 
cDNAs. Amplification conditions were: 94''C for 3min., 60°C for 2min., 72''C 
for 30sec. then 29 cycles of 94°C for 30sec., 60°C for 45sec., 72°C for 30sec. 
The PCR products were subcloned using the pmosBlue PCR cloning system 
(Amersham) and sequenced. 

3' anchored PCR: Randomly primed ss-cDNAs were synthesized as above with 
a different random primer (RA3'-NV; 5'-TCGTTGAGACTCGTACCAGCAGA 
GTCACGAGAGAGACTACACGGTACTGGNNNNNNNNN-3'; SEQ ID NO: 
13) to make it possible to carry out the 3' extension experiment described by 
Ravassard et al. . Two specific nested primers were designed based upon the 
sequence of the 138 bp fragment corresponding to the bHLH domain of hngnS, 
hngn35'-l (nucleotides 663-683): 5'-AGACGACGCGAAGCTCACCA-3' 
(SEQ ID NO: 14) and hngn35'-2 (nucleotides 675-693): 5'- 
GCTCACCAAGATCGAGACGCTGCG-3' (SEQ ID NO: 15). Applicants 
obtained the 3 '-extended cDNA by performing two rounds of nested PCR, one 
with A3'-l : 5'-ATCGTTGAGACTCGTACCAGCAGAG-3' (SEQ ID NO: 16) 
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and hngn35'-l, and the other with A3'-2: 5'- 

TCGTACCAGCAGAGTCACGAGAGAG-3' (SEQ ID NO: 17) and hngn35'-2. 
Applicants used the reaction mix described above. Amplification conditions 
were: 94^C for 3mm., 58°C for 45sec., 72°C for lmin.30 sec, then 34 cycles of 
5 94''C for 30sec., 58°C for 45sec., 72°C for lmin.30 sec. The PGR products were 
subcloned using the pmosBlue PGR cloning system and sequenced. 

5' anchored PGR or SLIC strategy (single stranded ligation of cDNA): Specific 
ss-cDNA was synthesized from himian embryonic spinal cord RNA with an 
antisense primer based on the 3 '-extended cDNA sequence: hngn3RT 
10 (nucleotides 863-845) 5'-CTGCCAGCCTGGGAGACTG-3' (SEQ ID NO: 18). 
Single-stranded DNA was ligated with the A5'-NV: 5 - 
CTGCATCTATCTAATGCTCCTCTCGCTACCTGCTCA 
CTCTGCGTGACATC-3* (SEQ ID NO: 19) as described by Dumas Milne 

29 

Edwards et aL . Two rounds of nested PGR were performed with the first SLIC 
15 primer A5'-l : 5'-GATGTCACGCAGAGTGAGCAGGTAG-3' (SEQ ID NO: 
20) and the specific primer hngn33'-l: (nucleotides 858-836) 5'- 
AGCCTGGGAGACTGGGGAGTAGA-3' (SEQ ID NO: 21), then with the 
second SLIC primer A5'-2: 5'-AGAGTGAGCAGGTAGCG AGAGGAG-3' 
(SEQ ID NO: 22) and the specific primer hngn33'-2: (nucleotides 747-726) 5'- 
2 0 CGCTATGCGCAGCGTTTGAGTC-3 ' (SEQ ID NO: 23). Both sets of PGR 
conditions were identical to those used for 3 '-anchored PGR. The PGR products 
were subcloned using the pGemT easy PCR cloning system (Promega) and 
sequenced. 

One of the main expression sites for Relax/ngnS is the developing spinal 

2 5 cord in rat and mouse^^' Applicants have isolated the human homolog of 

rodent ngn3 by producing a single-stranded cDNA (ss-cDNA) from mRNA 
extracted from the spinal cord of a 7-week old human embryo, PCR 
amplification was performed on the ss-cDNA using primers based on the basic 
and helix 11 regions of the Relax/ngnS sequences. A 138 bp product was 

3 0 amplified (nucleotides 585-722 of SEQ ID NO: 9; see Figure 3). The sequence 
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of the cloned PGR product was 91.2% and 87.5% identical to the nucleotide 
sequences of ngn3 and Relax respectively. It was 89.1% identical to the ngnl 
and ngn2 sequences. Thus, the isolated PGR fragment is the human homolog of 
a neurogenin, most probably Relax/ngn3. 

5 To obtain the full-length coding sequence corresponding to the 138 bp 

PGR fragment, 3'-anchored PGR on human embryonic spinal cord ss-cDNA was 
performed. A 655 bp PGR fragment was isolated, corresponding to a 3' 
extension of the original cDNA clone (nucleotides 675-1330 of SEQ ID NO: 9; 
see Figwe 3). Open reading frame analysis showed that the entire C-terminal 

1 0 region (nucleotides 675-963 of SEQ ED NO: 9) and 366 nucleotides of the 3' 
imtranslated region (nucleotides 964-1330 of SEQ ID NO: 9) had been cloned 
(Figure 3). Applicants obtained the 5' end of this cDNA, by performing 5'- 
anchored PGR by the SLIG method. Applicants ampUfied a 747 bp PGR 
fragment (nucleotides 1-747 of SEQ ID NO: 9) that overlapped the 3 '-extended 

1 5 cDNA clone over 73 bp (Figure 3). The two clones were obtained by 
independent PGR so it was necessary to demonstrate that the contiguous 
sequence corresponded to a real mRNA and not to a chimera. Applicants 
demonstrated by reverse transcription-mediated PGR using PGR primers of SEQ 
ID NO: 24 and SEQ ID NO: 25 that an 1 166 bp transcript is present in the 

2 0 human embryonic spinal cord^ the sequence of which indeed corresponded to the 
alignment of the 3' and 5' fragments (nucleotides 1-157 and nucleotides 321- 
1 330 of SEQ ID NO: 9). Nucleotides 1 58-320 of SEQ ID NO: 9 comprise a 5'- 
UTR interrupting intron identified by amplification performed on genomic DNA 
described below. Applicants completely sequenced both strands of four 

2 5 independent PGR products to rule out mutations induced by PGR. The 

nucleotide sequence was analyzed and the cloned cDNA was found to belong to 
the neurogenin family, and to be most closely related to Relax/ngn3, Applicants 
therefore named this cDNA human neurogenin 3 (hngn3). 

The hngnS cDNA comprises an open reading frame downstream from an 

3 0 in-frame methionine (Figure 3). This hngn3 open reading frame is predicted to 
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encode a 214 amino acid polypeptide (SEQ ID NO: 10). Alignment of the 
predicted amino-acid sequence encoded by hngnS with the sequences of the 
mice neurogenins. Relax (rat ngn3) and the human ngnl homolog (neuroDS) 
showed that most of the residues of the hngn3 protein were conserved in its 
5 murine counterparts. Indeed, hngn3 shares 78.0% and 77.1% identity with Relax 
and ngn3 respectively whereas only 48%, 46%, and 44% identity is found for 
neuroD3, ngnl and ngn2 respectively (Figure 4). 

The genomic sequences of the three mice neurogenin genes and that of the 
Mas hi gene have been analyzed and the entire coding sequences has been shovm 

1 8 20 

10 to be contained within a single exon ' . Applicants therefore analyzed the 
structure of the hngnS gene by PCR using the set of primers (SEQ ID NO: 24 
and SE ED NO: 25) that yielded the 1 166 bp fragment in the amplification of 
spinal cord cDNA. Interestingly, the PCR product obtained from genomic DNA 
was 1330 bp long and comprised the polynucleotide sequence as depicted in 

15 SEQ ID NO: 9. Sequence analysis showed that the coding sequence was 

contained within a single exon and that there was a 163 bp intron interrupting 
the 5' untranslated region (nucleotides 158-320 of SEQ ID NO: 9); see Figure 
3), The hngn3 nucleic acid and hngn3 polypeptide sequence information has 
been deposited within the GenBank/EMBL database (Accession number : 

20 AJ133776). 

EXAMPLE 8: PhyloRenetic analysis of the hngn3 related sequences. 

In this Example, Applicants carried out phylogenetic analysis of the 
relationships between the bHLH factors related to hngnS and found that the 
2 5 neurogenin genes were homologous to the Drosophila Tap gene and did not 
cluster with the atonal family. This study also shows that a great number of 
bHLH genes involved in neurogenesis are clearly misnamed. 

Optimal alignments of the sequences of the bHLH domains of the proteins 
most closely related to the protein encoded by our putative human neurogeninS 
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clone were subjected to phylogenetic analysis with distance methods. The tree 
obtained was rooted on the Drosophila Daughterless protein sequence (Figure 
5a), Five groups were consistently identified, the branching being well supported 
by the bootstrap values. The vertebrate neurogenin group has the product of the 
5 Drosophila tap gene as homolog, and is raost closely related to the neuroD 
group, for which no invertebrate homolog has yet been identified. The atonal- 
related group comprises the nematode lin-32 protein and all the athl proteins. 
These three groups are more distantly related to other sequences of a group 
0 consisting of the twisty scler axis ^ par axis, E-hand and D-hand sequences and to 

B 1 0 the group of the achaete-scute complex. Alignments of whole sequences could 

: _|: f 

only be used for the neurogenin-related sequences (Figure 5b), and made it 
possible to depict three paralogous groups in vertebrates, corresponding to ngnl, 
ngn2 and ngn3 in the mouse. According to this classification, the human gene 
^ designated neuroD3 is in fact orthologous to the other ngnl vertebrate 

^ 1 5 sequences, and the Xenopus xngnrla gene is orthologous to ngn2. 

r: A few key conclusions may be drawn fi-om the phylogenetic analysis. First, 

various paralogous groups of bHLH proteins were unambiguously defined, 
thereby helping to clarify the relationships within this transcription factor family 
(Figure 4a). All groups except neuroD contained vertebrate and Bilateria 
2 0 representatives (atonal and athl proteins, tap and ngn factors, AS-C-related 

proteins, twist-related factors). Drosophila atonal is commonly thought to be the 
archetype of this group, but the phylogenetic analysis does not support this 
contention. Atonal does belong to the athl group, but it is only distantly related 
to the neuroD and tap/ngn groups. It is not clear whether neuroD really has no 

2 5 Drosophila homolog or whether such a homolog has not yet been found. Four of 

these groups correspond to proneural or neural specification genes, and it may be 
that the ancestor genes of this group also had a proneural function that would 
impose some constraints on the structiu-e-function relationships of the related 
proteins. Finally, three paralogous proteins belong to the vertebrate ngn 

3 0 subfamily (ngnl, ngn2, and ngn3), although only one is found in metazoans. 
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which diverged much earlier during evolution such as Drosophila. This situation 
is reminiscent of what has been observed for many other family of genes 
expressed in the CNS, including homeobox proteins, growth factors, enzymes or 

21 22 

neurotransmitter receptors ' . Thus, as in the other gene families, the vertebrate 
5 ngn paralogous genes may have been generated by two gene duplications steps 
that occurred before the emergence of cartilaginous fish and, therefore, all 
vertebrate groups of species may have at least three representatives of the ngn 
subfamily of bHLH transcription factors. Finally, a phylogenetic approach 
should make it possible to unify the system of nomenclature of these proteins, 
0 1 0 some being clearly misnamed, such as Xenopus ngnrl a, human neuroD3, and all 

H the ath proteins (-ath2 to 5) with the exception of the athl family. 

3 EXAMPLE 9: Chromosome mapping of the hngnS gene. 

i=f Applicants used radiation hybrid mapping, to map the hngnS gene to 

I □ 1 5 chromosome 1 0q2 1.3, with a linkage placement confidence of "lodscore> 1 5". 

Applicants determined the chromosomal location of the hngnS gene by screening 
by PGR a panel of human/hamster radiation hybrids (Genebridge 4). Applicants 
first checked that our PGR conditions resulted in the amplification of a single 
1 .3 kbp firagment from human genomic DNA whereas no PGR product was 
2 0 obtained fi*om hamster genomic DNA. Applicants then amplified the 1 .3 kbp 
fi-agment fi^om the genomic DNA of the 93 radiation hybrids. Applicants 
checked the specificity of the amplification by Southern blot analysis of the 
resulting PGR products using an internal hngnS probe. 

All PGR primers were chosen with the Oligo 4.0 software program. The 
2 5 nucleotide position of all the primers fisted below and the hybridization probe 
are given relative to the genomic hngnS sequence (SEQ ID NO: 9). The 
following primers were used to screen by PGR the Genebridge 4 radiation hybrid 
panel: hngnSS (nucleotides 1-25): 5'-CCTGGGAGGCGATTGTGTGTTGTTT- 
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3' (SEQ ID NO: 24); hngnSAS (nucleotides 1330-1307): 5'- 
TGAGTGAGGGTAGGGCGACCCAAG-3' (SEQ ID NO: 25), 

PGR was performed with 25ng of genomic DNA from each radiation 
hybrid in a final reaction volume of 25 ^1 containing 0.8 |j.M of each primer, 200 
I^M dNTP, 1.5 mM MgCl2, 10 mM Tris, pH 9, 50 mM KCl, 0.1% Triton X-100, 
0,01% gelatin and 2 U Taq polymerase. Amplification conditions were: for 
3min., 60°C for 2min., 72'^C for lmin,30sec, then 29 cycles of 94°C for 30sec., 
60°C for 45sec., 72'^C for lmin30sec. 

The PGR products were loaded onto a 1% agarose gel and subjected to 
electrophoresis. The DNA was then transferred to a HybondN+ membrane 
(Amersham) under alkaline conditions and probed under standard conditions 
32 

with a P-labeled hngnS internal probe: (nucleotides 496-510) 5'- 
AGGAAGCTCCGGGCA-3' (SEQ ID NO: 26). The membrane was washed 
several times and autoradiographed. A specific 1.3 kbp fragment was detected in 
24 radiation hybrids of the 93 hybrids in the panel. The PGR results were 
processed using the Whitehead Institute Radiation Hybrid Mapping software, 
allowing only lod scores > 15 as minimal placement confidence. 

Precise chromosomal mapping was performed by linkage analysis using 
the Whitehead Institute radiation hybrid mapper. Applicants mapped the hngn3 
gene to chromosome 10 at 2.74cR from Dl OS 1606 (WI-5300) and 2.53cR from 
D10S1418 (CHLC.GATA101E02) with a placement confidence of "lod 
score>15". Both markers have been mapped to 10q21.3 mid according to the 
Whitehead Institute radiation hybrid map of chromosome 10 they are 3cR apart. 
Thus, the hngnS gene maps to the region between the Dl OS 1606 and Dl OS 141 8 
markers on the long arm of chromosome 10, and more precisely in position 
10q21.3 (Figure 6). 

The hngn3 gene mapped to the long arm of chromosome 10, more 
precisely to 10q21 .3. hngnS maps to the region encompassing annexin VII 
{ANX7) to CaM kinase II gamma (CAMKG) and also to that encompassing 
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ankyrin 3 (ANK3) to prosaponin (PSAP), The latter genes have been mapped in 
either human or mice. In mouse, the ANX7 and CAMKG genes have been 
mapped in mouse to chromosome 14 v^hereas the ANK3 and PSAP genes have 
been mapped to chromosome 10. Thus, the ngn3 gene is probably located in 
5 mouse either on chromosome 10 or 14. 



EXAMPLE 10: in situ localization of hnsn3 expression in the human 
developing embryo. 

In the developing himian embryo. Applicants showed by in situ 
1 0 hybridization, that the hngnS gene is expressed in the spinal cord and pancreas. 
Furthermore, the expression pattern of hngn3 was identical to that from ngn3 
genes in rat and mouse embryos suggesting that the function of ngn3 genes is 
conserved in mammals. 

Human embryos were obtained from legal abortions (Hopital R. Debre, 
15 Professors P. Blot and J. F. Oury, Paris) according to the guidelines of the 
French Ethical Committee. The embryos were aged from 6 to 7 weeks post- 
conception as determined by crown-to-rump length measured by ultrasound 
scanning and the Carnegie stage was evaluated according to the general 
morphology. Embryos were fixed in 4% paraformaldehyde overnight, 
2 0 cryoprotected 48 hours in IX Phosphate buffered saline (PBS) containing 1 5% 
sucrose, embedded in 7% gelatin prepared in IX PBS with 15% sucrose and 
frozen in isopentane. 

14 |im serial transverse sections were performed and probed with either 
the antisense (SEQ ID NO: 27) or the sense (SEQ ID NO: 28) digoxygenin-1 1- 
2 5 UTP labeled hngn3 riboprobes according to the modified whole mount in situ 

30 

hybridization protocol described by Wilkinson . Negative controls with sense 
probes did not show any signals. 

Transverse sections through human embryos at Carnegie stage 18 
(six and a half weeks) were probed with an hngn3 antisense riboprobe. No signal 
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was detected in the negative controls, probed with sense riboprobes (Figure 7b). 
^ngTi^-positive cells were found in the spinal cord and in the pancreas (Figure 

18 

7a, 7c). As previously described for ngnS in the mouse embryo , hngn3 is 
expressed in scattered cells throughout the pancreas (Figure 7d). These cells 

18 

5 correspond, in the mouse embryo, to the precursors of the islets of Langherans . 
hngn3 was also expressed in the ventral region of the spinal cord (Figure 7a, 7e- 
h), in two longitudinal stripes located at two dorso-ventral positions. The first 
one is strictly adjacent to the floor plate (Figure 7a, 7e and 7g) and the second, 
with a weaker intensity of staining is located around the midpoint between the 

1 0 floor plate and the roof plate (Figure 7a, 7e and 7h). Phase contrast microscopy 
clearly demonstrated the presence of hngnS-Qxpressing cells inside the 
ventricular zone of the neural tube (Figure If). In the most ventral stripe, hngn3- 
positive cells are present near the limien of the neural tube and in the lateral part 
of the ventricular zone (Figure 7g). hi the medial stripe, most of the hngnS- 

1 5 expressing cells are located near the marginal border of the ventricular zone 
(Figure 7h). This pattern is very similar to that for Relax and ngnS in the spinal 
cord of developing rat and mouse embryos^ ^' Relax and ngn3 are also 
expressed in the hindbrain and in the ventral region of the forebrain close to the 
optic recess^^' Due to limitations inherent in the use of human embryos, all the 

2 0 samples collected were extensively damaged at the anterior part. Thus, 
histological analysis was not possible for the hindbrain or the forebrain. 
Nevertheless, Applicants found that hngnS was expressed in the same 
compartments as Relax and ngn3 in the pancreas and the spinal cord. Our results 
suggest a conservation of function for the neurogenin 3 genes of humans, mice 

2 5 and rats. 

The expression pattem of hngn3 in the spinal cord is identical to that of 
Relax in rats and ngn3 in mice^^' In the pancreas, hngn3 expressing cells 

18 

were found in clusters similar to those described in the mouse pancreas . 
Several lines of evidence support the idea that ngn3 governs the differentiation 

3 0 of the endocrine pancreas. Firstly, Sommer et al}^ have shown that the 
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expression of ngnS and neuroD is coordinated both in space and time suggesting 
that ngn3 controls neuroD expression. Secondly, mice carrying a null mutation 
in the neuroD gene have severe diabetes^^. Mutant mice have far fewer insulin- 
producing 6-cells and do not develop mature islets. Finally, it is most likely that 
5 ngn3 initiates the differentiation of pancreatic B-cells by controlling the 

23 

expression of neuroD and the subsequent production of insulin . More recent 
data has demonstrated that ngn3 is the gene that controls the entire 
differentiation of the endocrine pancreas. Analysis of ngn mutant mice has 
shown that these mice lack the four pancreatic endocrine cell types (including 

3 1 

1 0 the insulin producing cells). In addition, it has been demonstrated that the 
overexpression of ngn3 in the developing pancreas yields an increased number 

32 

of endocrine cells. Thus^ any mutations in the hngnS gene could lead to severe 
diabetes. The hngnS gene has been mapped to the 10q21.3 position, hx this 
region, no genes for inheritable diseases with phenotypes consistent with a 

15 hngnS-altcred function have been identified. The nxmierous studies performed on 
the genetic predisposition to type 1 diabetes mellitus have shown polygenic 
inheritance of the disease with a major locus at the major histocompatibility 
complex (MHC)^^'^^ The various susceptibility loci outside the MHC locus 
remain to be identified. hngnS is a strong candidate gene for the yet unmapped 

2 0 genes for insulin-dependent diabetes. It may also be involved in other human 

26 

diseases, Rostomily et ah have shown that the human neurogenin 1 homolog 
gene, NeuroD3, is expressed in primitive neuroectodermal tumors, especially in 
those that are highly metastatic. Thus, the analysis of hngn3 expression in 
neuroectodermal tumors and in pancreatic b-cells tumors, such as insulinomas, 

2 5 may indicate whether the disregulation of the hngn3 gene is involved in 

malignant transformation. 

The human homolog of neurogenin 3 {hngn3) fi'om human embryonic 
spinal cord has been now been identified by Applicants. Phylogenetic analysis 
showed that the neurogenin genes are homologous to the Drosophila Tap gene 

3 0 and that the atonal gene is certainly not the archetype of this gene family as was 
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previously suggested. A radiation hybrid panel was screened, leading to the 
mapping of the hngn3 gene to the 10q2L3 position. Early human embryos were 
used to investigate the pattern ofhngnS expression. Most importantly, the cells 
expressing hngnS were located in the same compartments of the spinal cord and 
5 pancreas as those expressing ngnS genes in rats and mice. Thus, these data 
support the idea of a functional conservation among the neurogenin 3 genes. 

The structure of the various neurogenin genes appears to be conserved, 
with, as described here for the hngnS gene, the entire coding sequence contained 
within a single exon. Similar genomic organizations have been reported for the 
1 0 other three mouse ngn genes, the Zebrafish ngnl gene and the mouse Mashl 
gene ' ' . The identification of a single intron interrupting the 5' untranslated 
region of the hngn3 gene is similar to the organization of that of two other 

12 16 

members of the ngn family, the mouse ngn2 and Zebrafish ngnl genes ' . The 
structures of the other known neurogenins have not yet been fully analyzed, 
1 5 However, the presence of an intron within the 5'UTR and the absence of introns 
within the coding region maybe common features of neurogenin genes. 

ABBREVIATIONS 

bp: base pairs, kbp: kilo base pairs, cR: centiRays 
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CLAIMS 

1 . An isolated nucleic acid comprising a polynucleotide sequence of 
SEQ ID NO: 10, or of a complementary polynucleotide sequence. 
5 2. An isolated nucleic acid comprising at least eight consecutive 

nucleotides of a polynucleotide sequence of SEQ ID NO: 1 0, or of a 
complementary polynucleotide sequence. 

3. An isolated nucleic acid comprising at least 80% nucleotide 
identity with a nucleic acid comprising SEQ ID NO: 10, or a complementary 

1 0 polynucleotide sequence. 

4. The isolated nucleic acid according to claim 3, wherein the 
nucleic acid comprises an 85%, 90%, 95%, or 98% nucleotide identity with the 
nucleic acid comprising SEQ ID NO: 10, or a complementary polynucleotide 
sequence. 

15 5. An isolated nucleic acid that hybridizes under high stringency 

conditions with a nucleic acid comprising SEQ ID NO: 10, or a complementary 
polynucleotide sequence. 

6. An isolated nucleic acid comprising a polynucleotide sequence as 
depicted in SEQ ID NO: 10, or of a complementary polynucleotide sequence. 

2 0 7. A nucleotide probe or primer specific for an ngn3 nucleic acid, 

wherein the nucleotide probe or primer comprises at least 15 consecutive 
nucleotides of a polynucleotide sequence of SEQ ID NO: 10, or of a 
complementary polynucleotide sequence. 

8. The nucleotide probe or primer according to claim 7, wherein the 
2 5 nucleotide probe or primer comprises a marker compound. 

9. A nucleotide probe or primer specific for an ngn3 nucleic acid, 
wherein the nucleotide probe or primer comprises SEQ ID NO: 10, or of a 
complementary polynucleotide sequence. 

10. The nucleotide probe or primer according to claim 9, wherein the 
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nucleotide probe or primer comprises a marker compound. 

11. A method of amplifying a region of the nucleic acid according to 
claim 1, wherein the method comprises: 

a) contacting the nucleic acid with two nucleotide primers, wherein the first 
5 nucleotide primer hybridizes at a position 5' of the region of the nucleic acid, and 
the second nucleotide primer hybridizes at a position 3' of the region of the nucleic 
acid, in the presence of reagents necessary for an amplification reaction; and 
i;3 b) detecting the amphfied nucleic acid region. 

; 12. The method according to claim 1 1 , wherein the two nucleotide 

10 primers are selected from the groxip consisting of 
n a) a nucleotide primer comprising at least 15 consecutive nucleotides of a 

Jl polynucleotide sequence of SEQ ED NO: 10, or of a complementary polynucleotide 

sequence, and 

Tl b) a nucleotide primer comprising a polynucleotide sequence of SEQ ID 

1 5 NO: 10, or of a complementary polynucleotide sequence. 
3 13. A kit for amplifying the nucleic acid according to claim 1 , wherein 

the kit comprises: 

a) two nucleotide primers whose hybridization position is located 
respectively 5' and 3' of the region of the nucleic acid; and optionally, 

2 0 b) reagents necessary for an amplification reaction. 

14, The kit according to claim 13, wherein the two nucleotide primers 
are selected fi*om the group consisting of 

a) a nucleotide primer comprising at least 15 consecutive nucleotides of a 
polynucleotide sequence of SEQ ID NO: 9, or of a complementary polynucleotide 

2 5 sequence, and 

b) a nucleotide primer comprising a polynucleotide sequence of any one of 
SEQ ID NOs: 9, 11, 12, 14, 15, 16, 17, 18, 19, 21, 23, 24, 25, or of a 
complementary polynucleotide sequence. 

15. A method of detecting a nucleic acid according to claim 1, wherein 

3 0 the method comprises: 
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a) contacting the nucleic acid with a nucleotide probe selected from the 
group consisting of 

1) a nucleotide probe comprising at least 15 consecutive nucleotides 
of a polynucleotide sequence of SEQ ID NO: 9, or of a complementary 

5 polynucleotide sequence, and 

2) a nucleotide probe comprising a polynucleotide sequence of any 
one ofSEQIDNOs: 9, 11, 12, 14, 15, 16, 17, 18, 19,21,23, 24, 25, or of a 
complementary polynucleotide sequence, and 

b) detecting a complex formed between the nucleic acid and the probe. 

10 16, The method of detection according to claim 15, wherein the probe is 

immobilized on a support. 

17. A kit for detecting the nucleic acid according to claim 1, wherein 
the kit comprises 

a) a nucleotide probe selected from the group consisting of 

15 1) a nucleotide probe comprising at least 15 consecutive nucleotides 

of a polynucleotide sequence of SEQ ID NO: 9, or of a complementary 
polynucleotide sequence, and 

2) a nucleotide primer comprising a polynucleotide sequence of any 
one of SEQ ID NOs: 9, 11, 12, 14, 15, 16, 17, 18, 19, 21, 23, 24, 25, or of a 

2 0 complementary polynucleotide sequence, and optionally, 

b) a reagent necessary for a hybridization reaction. 

1 8. The kit according to claim 17, wherein the probe is immobilized on 
a support. 

19. A recombinant vector comprising the nucleic acid according to 

2 5 claim 1 . 

20. The recombinant vector according to claim 19, wherein the 
recombinant vector is an adenovirus. 

21 . A recombinant vector comprising the nucleic acid according to 
claim 6. 

3 0 22. The recombinant vector according to claim 2 1 , wherein the 
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recombinant vector is an adenovirus. 

23. A recombinant host cell comprising the nucleic acid according to 
claim 1 , 

24. A recombinant host cell comprising the nucleic acid according to 
claim 6. 

25. A recombinant host cell comprising the recombinant vector 
according to claim 19. 

26. A recombinant host cell comprising the recombinant vector 
according to claim 21. 

27. An isolated nucleic acid encoding a polypeptide comprising an 
amino acid sequence of SEQ ID NO: 10. 

28. A recombinant vector comprising the nucleic acid according to 
claim 27. 

29. A recombinant host cell comprising the recombinant vector 
according to claim 28. 

30. A recombinant host cell comprising the nucleic acid according to 
claim 27. 

3 1 . An isolated polypeptide comprising an amino acid sequence of 
SEQ ID NO: 10. 

32. An antibody directed against the isolated polypeptide according to 
claim 31. 

33. The antibody according to claim 32, wherein the antibody comprises 
a detectable compound. 

34. An isolated polypeptide comprising an amino acid sequence as 
depicted in SEQ ID NO: 10. 

35. An antibody directed against the isolated polypeptide according to 
claim 34. 

36. The antibody according to claim 35, wherein the antibody comprises 
a detectable compound. 
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37. A method of detecting a polypeptide, wherein the method comprises 

a) contacting the polypeptide with an antibody according to claim 32; and 

b) detecting an antigen/antibody complex formed between the polypeptide 
and the antibody. 

5 38. A diagnostic kit for detecting a polypeptide, wherein the kit 

comprises 

a) the antibody according to claim 32; and 

b) a reagent allowing detection of an antigen/antibody complex formed 
between the polypeptide and the antibody. 

10 39. A pharmaceutical composition comprising the nucleic acid 

according to claim I and a physiologically compatible excipient. 

40. A pharmaceutical composition comprising the nucleic acid 
according to claim 6 and a physiologically compatible excipient. 

41 . A pharmaceutical composition comprising the recombinant vector 
15 according to claim 19 and a physiologically compatible excipient. 

42. A pharmaceutical composition comprising the recombinant vector 
according to claim 21 and a physiologically compatible excipient. 

43 . A pharmaceutical composition comprising the nucleic acid 
according to claim 27 and a physiologically compatible excipient. 

2 0 44. A pharmaceutical composition comprising the recombinant vector 

according to claim 28 and a physiologically compatible excipient. 

45. A pharmaceutical composition comprising the recombinant host cell 
according to claim 29 and a physiologically compatible excipient. 

46. A pharmaceutical composition comprising the recombinant host cell 

2 5 according to claim 30 and a physiologically compatible excipient. 

47. A pharmaceutical composition comprising the polypeptide 
according to claim 31 and a physiologically compatible excipient. 

48. A pharmaceutical composition comprising the polypeptide 
according to claim 34 and a physiologically compatible excipient. 

3 0 49. Use of the nucleic acid according to claim 1 for the manufacture of 
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a medicament intended for the prevention or treatment of a nervous system 
dysfunction. 

50, Use of the nucleic acid according to claim 6 for the manufacture of 
a medicament for the prevention or treatment of a platelet activation dysfunction. 
5 51. Use of the recombinant vector according to claim 1 9 for the 

manufacture of a medicament for the prevention or treatment of a nervous system 
dysfunction. 

52. Use of the recombinant vector according to claim 21 for the 
manufacture of a medicament intended for the prevention or treatment of a nervous 

1 0 system dysfunction, 

53. Use of the nucleic acid according to claim 27 for the manufacture of 
a medicament for the prevention or treatment of a nervous system dysfunction. 

54. Use of the recombinant vector according to claim 28 for the 
manufacture of a medicament for the prevention or treatment of a nervous system 

15 dysfimction. 

55. Use of the recombinant host cell according to claim 29 for the 
manufacture of a medicament for the prevention or treatment of a nervous system 
dysfunction. 

56. Use of the recombinant host cell according to claim 30 for the 

2 0 manufacture of a medicament for the prevention or treatment of a nervous system 

dysfunction. 

57. Use of the polypeptide according to claim 31 for the manufacture of 
a medicament intended for the prevention or treatment of a nervous system 
dysfunction. 

25 58. Use of the polypeptide according to claim 3 1 for screening an 

active ingredient for the prevention or treatment of a nervous system 
dysfunction. 

59, Use of a recombinant host cell expressing the polypeptide 
according to claim 31 for screening an active ingredient for the prevention or 

3 0 treatment of a nervous system dysfunction. 
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60. An implant comprising the recombinmt host cell according to claim 

23. 

61 . An implant comprising the recombinant host cell according to claim 

25. 

5 62. An implant comprising the recombinant host cell according to claim 

29. 

63. A method of identifying a modulator, agonist, or antagonist of a 
polypeptide according to the invention in a sample comprising 

a) obtaining a cell, for example a cell line, that, either naturally or after 
1 0 transfecting the cell with a nucleic acid encoding a polypeptide according to the 

invention, expresses a polypeptide according to the invention, 

b) transfecting the cell with a nucleic acid encoding a marker gene, 

c) incubating the cell of step b) with a test solution or sample comprising a 
potential modulator, agonist or antagonist, 

15 d) measuring the amount of p-galactosidase activity, and 

e) comparing the amount of p-galactosidase activity measured in step d) 
with an amount of p-galactosidase activity measured with a cell that has not been 
previously incubated in the presence of the candidate modulator, agonist, or 
antagonist compoimd for the polypeptide according to the invention. 

2 0 64. The method according to claim 63, wherein the polypeptide 

comprises an amino acid sequence of SEQ ID NO: 10. 

65. The method according to claim 63, wherein the nucleic acid encodes 
a p-galactosidase (p-gal) marker gene. 

66. Use of the polypeptide according to claim 31 to control and/or 
2 5 participate in the expression of a gene. 
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ABSTRACT OF THE INVENTION 



The present invention relates to nucleic acids that encode novel basic helix- 
loop-helix polypeptides. The present invention also relates to novel basic helix- 
loop-helix polypeptides, in particular, rat Relax and human ngn3 polypeptides. 
The invention also relates to means for the detection of nucleic acids and 
polypeptides according to the invention. The invention also relates to methods for 
the detection of activators or inhibitors of the polypeptides of the invention. 
Finally, the present invention relates to methods of prevention and/or treatment of 
pathologies, dysfunctions, disorders or conditions affecting the nervous system. 
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SEQUENCE LISTING 



<110> ICARD-LIEPKALNS, Christine 
MALLET, Jacques 
RAVASSARD, Philippe 

<120> POLYPEPTIDES OF THE "BASIC-HELIX-LOOP-HELIX" bHLH 
FAMILY, CORRESPONDING NUCLEIC ACID SEQUENCES 

<130> ST96042A-US 

<140> US 9/331,356 
<141> 1999-06-18 

<150> FR96/15651 
<151> 1996-12-19 

<150> PCT/FR97/02368 
<151> 1997-12-19 

<160> 28 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1460 
<212> DNA 

<213> Rattus norvegicus 
<400> 1 

gcaggtagcg agaggagcag tccctgggcc cccgttgctg attggcccgt ggcacaggca 
60 

gcagcccggc aggcacgctc ctggtccggg cagagcagat aaagcgtgcc aggggacaca 
12 0 

cgattagcag ctcagaagtc cctctgggtc tcaccactgc acagaggccg aggaccccct 
180 

ccgagcttct ttgctgcctc cagacgcaat ttactccagg cgagggcgcc tgcagctcag 
240 

caaaacttcg aagcgagcag aggggttcag ctatccaccg ctgcttgact ctgaccaccc 
300 

gcagctctct gttcttttga gcccggagta actaggtaac atttaggaac ctccaaaggg 
360 

tagaagaggg gagtgggtgg gcgtactcta gtcccgcgtg gagtgacctc taagtcagag 
420 

actgtcacac cccccttcca ttttttccca acctcaggat ggcgcctcat cccttggatg 
480 

cgcccaccat ccaagtgtcc caagagaccc agcaaccctt tcccggagcc tcggaccacg 
540 

aagtgctcag ttccaattcc accccaccta gccccactct cgtaccgagg gactgctccg 
600 

aagcagaagc aggtgactgc cgagggacat cgaggaagct ccgtgcgcgg cgcggagggc 
660 

gcaacaggcc caagagcgag ttggcactga gcaagcagcg acgaagccgg cgcaagaagg 
720 

ccaacgaccg ggagcgcaac cgcatgcaca accttaactc cgcgctggat gcgctgcgcg 
780 



gtgtcctgcc 
840 

cccacaacta 
900 

gccccgagcc 
960 

actggggctc 
1020 

tggaggagtt 
1080 

tggtgttctc 
1140 

aaagggaggg 
1200 

cccttctggc 
1260 

gctgtgcgtg 
1320 

acctcaaact 
1380 

atacttcctg 
1440 

agagtgacct 
1460 



caccttcccg 
catttgggca 
ccctgtgccc 
tatctactcc 
ccctggcctg 
agacttcttg 
agtcagagct 
tttcattagt 
cacaaaggac 
cccgctccaa 
gtgactccgc 
aatccagtgt 



gatgacgcca 
ctgactcaga 
tgtggggagc 
ccagtttccc 
caggtgccca 
tgaagggccc 
gtctgaaatg 
caggtccctg 
attgcaggct 
gcagaggaga 
cctctttcaa 



aacttacaaa 
cgctgcgcat 
tgggaagccc 
aagctggtag 
gctccccatc 
aaacaggccc 
gaaggtagtg 
atttaaccag 
gatctcctct 
gccgtagcac 
atctgcgggc 



gatcgagacc 
agcggaccac 

gggagggggc 
cctgagcccc 
ctgtctgctc 
tgggcggtgg 
gaggcactcg 
gattcgcaca 
taaccctcct 
taaatagttg 
ctccaaccac 



ctgcgcttcg 
agcttctacg 
tccagcggcg 
acagcctcat 
ccgggcaccc 
gcgctggcag 
agcatctcgc 
gttccttgct 
cagtgtggcc 
ggagactccc 
cgctttctcc 



<210> 2 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR Primer 
<220> 

<223> n - Inosine 
<400> 2 

aatkhgrngng agcgcndkcg cryg 
24 



<210> 3 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR Primers 
<400> 3 

ggcsrdtytc agggtsybga yctt 
24 



<210> 4 
<211> 25 



<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PGR Primers 
<400> 4 

aaccttaact ccgcgctgga tgcgc 
25 



<210> 5 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR Primers 
<400> 5 

cgcggtgtcc tgcccacc 
18 



<210> 6 
<211> 6 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: E box 

<400> 6 

caggtg 

6 



<210> 7 
<211> 6 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Mutated E box 

<400> 7 

tccgtg 

6 



<210> 8 
<211> 214 
<212> PRT 

<213> Rattus norvegicus 
<400> 8 

Met Ala Pro His Pro Leu Asp Ala Pro Thr lie Gin Val Ser Gin Glu 



1 



5 



10 



15 



Thr Gin Gin Pro Phe Pro Gly Ala Ser Asp His Glu Val Leu Ser Ser 
20 25 30 

Asn Ser Thr Pro Pro Ser Pro Thr Leu Val Pro Arg Asp Cys Ser Glu 
35 40 45 

Ala Glu Ala Gly Asp Cys Arg Gly Thr Ser Arg Lys Leu Arg Ala Arg 
50 55 60 

Arg Gly Gly Arg Asn Arg Pro Lys Ser Glu Leu Ala Leu Ser Lys Gin 
65 70 75 80 

Arg Arg Ser Arg Arg Lys Lys Ala Asn Asp Arg Glu Arg Asn Arg Met 

85 90 95 

His Asn Leu Asn Ser Ala Leu Asp Ala Leu Arg Gly Val Leu Pro Thr 
100 105 110 

Phe Pro Asp Asp Ala Lys Leu Thr Lys lie Glu Thr Leu Arg Phe Ala 
115 120 125 

His Asn Tyr He Trp Ala Leu Thr Gin Thr Leu Arg He Ala Asp His 
130 135 140 

Ser Phe Tyr Gly Pro Glu Pro Pro Val Pro Cys Gly Glu Leu Gly Ser 
145 150 155 160 

Pro Gly Gly Gly Ser Ser Gly Asp Trp Gly Ser He Tyr Ser Pro Val 

165 170 175 

Ser Gin Ala Gly Ser Leu Ser Pro Thr Ala Ser Leu Glu Glu Phe Pro 
180 185 190 

Gly Leu Gin Val Pro Ser Ser Pro Ser Cys Leu Leu Pro Gly Thr Leu 
195 200 205 

Val Phe Ser Asp Phe Leu 
210 



<210> 9 

<211> 1330 

<212> DNA 

<213> Homo sapiens 

<400> 9 

cctcggaccc cattctctct tcttttctcc tttggggctg gggcaactcc caggcggggg 
60 

cgcctgcagc tcagctgaac ttggcgacca gaagcccgct gagctcccca cggccctcgc 
120 

tgctcatcgc tctctattct tttgcgccgg tagaaaggta atatttggag gccttcgagg 
180 

gacgggcagg ggaaagaggg atcctctgac ccagcggggg ctgggaggat ggctgttttt 
240 



gttttttccc 
300 

tccctctgac 
360 

acccgtgaga 
420 

tccgccccgc 
480 

tgccgagggg 
540 

gagttggcac 
600 

aatcgaatgc 
660 

ccagacgacg 
720 

gcgctgactc 
780 

ccgcactgcg 
840 

tccccagtct 
900 

ctgctggggg 
960 

ctgtgaaagg 
1020 

gccgggagcc 
1080 

ttctccctag 
1140 

tccccggccc 
1200 

gcattgcaaa 
1260 

aagaatacta 
1320 

ccctcactca 
1330 



acctagcctc 
cccgccgtag 
cggagcggtc 
ccagccccac 
ccccgaggaa 
tgagcaagca 
acgacctcaa 
cgaagctcac 
aaacgctgcg 
gggagctggg 
cccaggctgg 
ccacctcttc 
acctgtctgt 
gtagagggtg 
ctgacccctg 
tccgagccgc 
gtgcgctcat 
gaatggtagc 



ggaatcgcgg 
gatgacgcct 
cttccccaga 
tcgcacaccg 
gctccgggca 
gcgacggagt 
ctcggcactg 
caagatcgag 
catagcggac 
cagcccaggc 
cagcctgagt 
cgcctgcttg 
cgctgggctg 
gccgacggcg 
gccggcccag 
gccaacgcac 
tttaggcctc 
actacccggc 



actgcgccgt 
caaccctcgg 
gcctcggaag 
gggaactgcg 
cggcgcgggg 
cggcgaaaga 
gacgccctgc 
acgctgcgct 
cacagcttgt 
ggtccccccg 
cccgccgcgt 
agcccaggca 
tgggtgctaa 
gcggccctca 
gcctccacgg 
gcaacccttg 
ctctctgcca 
cggagccgcc 



gacggactca 
gtgcgcccac 
acgaagtgac 
cagaggcgga 
gacgcagccg 
aggccaacga 
gcggtgtcct 
tcgcccacaa 
acgcgctgga 
gggactgggg 
cgctggagga 
gtctggcttt 
gggtaaggga 
aaagcacttg 
gggcggtagg 
ctgctgcccg 
ccaccccata 
caccgtcttg 



aacttaccct 
tgtccaagtg 
ctgccccacg 
agagggaggc 
gcctaagagc 
ccgcgagcgc 
gcccaccttc 
ctacatctgg 
gccgccggcg 
gtccctctac 
gcgacccggg 
ctcagatttt 
gagggaggga 
ttccttctgc 
ctgggttcat 
cgcgaagtgg 
atcccattca 
ggtcgcccta 



<210> 10 
<211> 214 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Met Thr Pro Gin Pro Ser Gly Ala Pro Thr Val Gin Val Thr Arg Glu 
15 10 15 

Thr Glu Arg Ser Phe Pro Arg Ala Ser Glu Asp Glu Val Thr Cys Pro 
20 25 30 

Thr Ser Ala Pro Pro Ser Pro Thr Arg Thr Pro Gly Asn Cys Ala Glu 
35 40 45 



Ala Glu Glu Gly Gly Cys Arg Gly Ala Pro Arg Lys Leu Arg Ala Arg 
50 55 60 



Arg Gly Gly Arg Ser Arg Pro Lys Ser Glu Leu Ala Leu Ser Lys Gin 
65 70 75 80 



Arg Arg Ser Arg Arg Lys Lys Ala Asn Asp Arg Glu Arg Asn Arg Met 

85 90 95 

His Asp Leu Asn Ser Ala Leu Asp Ala Leu Arg Gly Val Leu Pro Thr 
100 105 110 

Phe Pro Asp Asp Ala Lys Leu Thr Lys lie Glu Thr Leu Arg Phe Ala 
115 120 125 

His Asn Tyr lie Trp Ala Leu Thr Gin Thr Leu Arg lie Ala Asp His 
130 135 140 

Ser Leu Tyr Ala Leu Glu Pro Pro Ala Pro His Cys Gly Glu Leu Gly 
145 150 155 160 

Ser Pro Gly Gly Pro Pro Gly Asp Trp Gly Ser Leu Tyr Ser Pro Val 

165 170 175 

Ser Gin Ala Gly Ser Leu Ser Pro Ala Ala Ser Leu Glu Glu Arg Pro 
180 185 190 

Gly Leu Leu Gly Ala Thr Ser Ser Ala Cys Leu Ser Pro Gly Ser Leu 
195 200 205 



Ala Phe Ser Asp Phe Leu 
210 



<210> 11 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR Primer 
<400> 11 

caacgaccgg cagcgcaa 
18 



<210> 12 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR Primer 
<400> 12 

gcccagatgt agttgtgggc gaag 
24 



<210> 13 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR Primer 
<400> 13 

atcgttgaga ctcgtaccag cagagtcacg agagagacta cacggtactg gnnnnnnnnn 
60 



<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR Primer 
<400> 14 

agacgacgcg aagctcacca 
20 



<210> 15 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR Primer 
<400> 15 

gctcaccaag atcgagacgc tgcg 
24 



<210> 16 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR Primer 
<400> 16 

atcgttgaga ctcgtaccag cagag 
25 



<210> 17 

<211> 25 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PGR Primer 
<400> 17 

tcgtaccagc agagtcacga gagag 
25 



<210> 18 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR Primer 
<400> 18 

ctgccagcct gggagactg 
19 



<210> 19 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR Primer 
<400> 19 

ctgcatctat ctaatgctcc tctcgctacc tgctcactct gcgtgacatc 
50 



<210> 20 
<211> 25 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR Primer 
<400> 20 

gatgtcacgc agagtgagca ggtag 
25 



<210> 21 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PCR Primer 



agcctgggag actggggagt aga 
23 



<210> 22 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: PCR Primer 
<400> 22 

agagtgagca ggtagcgaga ggag 
24 



<210> 23 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR Primer 
<400> 23 

cgctatgcgc agcgtttgag tc 
22 



<210> 24 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR Primer 
<400> 24 

cctcggaccc cattctctct tcttt 
25 



<210> 25 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<:223> Description of Artificial Sequence: PCR Primer 
<400> 25 

tgagtgaggg tagggcgacc caag 
24 



<210> 26 



<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Probe 
<400> 26 

aggaagctcc gggca 
15 



<210> 27 
<211> 1381 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Probe 
<400> 27 

gggcgaauug ggcccgacgu cgcaugcucc cggccgccau ggccgcggga uuugagugag 
60 

gguagggcga cccaagacgg ugggcggcuc cggccgggua gugcuaccau ucuaguauuc 
120 

uuugaauggg auuauggggu gguggcagag aggaggccua aaaugagcgc acuuugcaau 
180 

gcccacuucg cgcgggcagc agcaaggguu gcgugcguug gcgcggcucg gagggccggg 
240 

gaaugaaccc agccuaccgc ccccguggag gccugggccg gccagggguc agcuagggag 
300 

aagcagaagg aacaagugcu uuugagggcc gccgccgucg gccacccucu acggcucccg 
360 

gcucccuccc ucucccuuac ccuuagcacc cacagcccag cgacagacag guccuuucac 
420 

agaaaaucug agaaagccag acugccuggg cucaagcagg cggaagaggu ggcccccagc 
480 

agcccggguc gcuccuccag cgacgcggcg ggacucaggc ugccagccug ggagacuggg 
540 

gaguagaggg acccccaguc cccgggggga ccgccugggc ugcccagcuc cccgcagugc 
600 

ggcgccggcg gcuccagcgc guacaagcug ugguccgcua ugcgcagcgu uugagucagc 
660 

gcccagaugu aguugugggc gaagcgcagc gucucgaucu uggugagcuu cgcgucgucu 
720 

gggaaggugg gcaggacacc gcgcagggcg uccagugccg aguugagguc gugcauucga 
780 

uugcgcucgc ggucguuggc cuucuuucgc cgacuccguc gcugcuugcu cagugccaac 
840 

ucgcucuuag gccggcugcg ucccccgcgc cgugcccgga gcuuccucgg ggccccucgg 
900 

cagccucccu cuuccgccuc ugcgcaguuc cccggugugc gaguggggcu gggcggggcg 
960 

gacguggggc aggucacuuc gucuuccgag gcucugggga aggaccgcuc cgucucacgg 
1020 

ucacuuggac agugggcgca cccgaggguu gaggcgucau ccuacggcgg ggucagaggg 
1080 



aaggguaagu uugaguccgu 
1140 

caaaaacagc cauccuccca 
1200 

cccucgaagg ccuccaaaua 
1260 

agcgagggcc guggggagcu 
1320 

gcccccgccu gggaguugcc 
1380 

g 

1381 



cacggcgcag uccgcgauuc 
gcccccgcug ggucagagga 
uuaccuuucu accggcgcaa 
cagcgggcuu cuggucgcca 
ccagccccaa aggagaaaag 



cgaggcuagg ugggaaaaaa 
ucccucuuuc cccugcccgu 
aagaauagag agcgaugagc 
aguucagcug agcugcaggc 
aagagagaau gggguccgag 



<210> 28 
<211> 1427 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Probe 
<400> 28 

agcuaugcau ccaacgcguu gggagcucuc ccauaugguc gaccugcagg cggccgcgaa 
60 

uucacuagug auuccucgga ccccauucuc ucuucuuuuc uccuuugggg cuggggcaac 
120 

ucccaggcgg gggcgccugc agcucagcug aacuuggcga ccagaagccc gcugagcucc 
180 

ccacggcccu cgcugcucau cgcucucuau ucuuuugcgc cgguagaaag guaauauuug 
240 

gaggccuucg agggacgggc aggggaaaga gggauccucu gacccagcgg gggcugggag 
300 

gauggcuguu uuuguuuuuu cccaccuagc cucggaaucg cggacugcgc cgugacggac 
360 

ucaaacuuac ccuucccucu gaccccgccg uaggaugacg ccucaacccu cgggugcgcc 
420 

cacuguccaa gugacccgug agacggagcg guccuucccc agagccucgg aagacgaagu 
480 

gaccugcccc acguccgccc cgcccagccc cacucgcaca ccggggaacu gcgcagaggc 
540 

ggaagaggga ggcugccgag gggccccgag gaagcuccgg gcacggcgcg ggggacgcag 
600 

ccggccuaag agcgaguugg cacugagcaa gcagcgacgg agucggcgaa agaaggccaa 
660 

cgaccgcgag cgcaaucgaa ugcacgaccu caacucggca cuggacgccc ugcgcggugu 
720 

ccugcccacc uucccagacg acgcgaagcu caccaagauc gagacgcugc gcuucgccca 
780 

caacuacauc ugggcgcuga cucaaacgcu gcgcauagcg gaccacagcu uguacgcgcu 
840 

ggagccgccg gcgccgcacu gcggggagcu gggcagccca ggcggucccc ccggggacug 
900 

9999^<^ccuc uacuccccag ucucccaggc uggcagccug agucccgccg cgucgcugga 
960 

ggagcgaccc gggcugcugg gggccaccuc uuccgccugc uugagcccag gcagucuggc 
1020 



uuucucagau uuucugugaa aggaccuguc ugucgcuggg cugugggugc uaaggguaag 
1080 

ggagagggag ggagccggga gccguagagg guggccgacg gcggcggccc ucaaaagcac 
1140 

uuguuccuuc ugcuucuccc uagcugaccc cuggccggcc caggccucca cgggggcggu 
1200 

aggcuggguu cauuccccgg cccuccgagc cgcgccaacg cacgcaaccc uugcugcugc 
1260 

ccgcgcgaag ugggcauugc aaagugcgcu cauuuuaggc cuccucucug ccaccacccc 
1320 

auaaucccau ucaaagaaua cuagaauggu agcacuaccc ggccggagcc gcccaccguc 
1380 

^^999^cgcc cuacccucac ucaaaucgaa uucccgcggc cgccaug 
1427 



